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DIAGNOSIS OF HYPERINSULINEMIA AND TYPE II DIABETES AND PROTECTION AGAINST SAME 

This application claims the benefit under 35 USC 119(e) of 
prior U.S. provisional applications 60/460,415, filed April 
7, 2003 (KOPCHICK6 -USA) , and 60/506,716, filed Sept. 30, 
2003 (KOPCHICK6 . 1 -USA) , both of which are hereby 
incorporated by reference in their entirety. 

Cross-Ref erence to Related Applications 

The instant application adds 6 month expression data to 
the disclosure of US Prov. Appl . 60/460,415, filed April 7, 
2 0 03 (KOPCHICK6 -USA) . 

In U.S. Provisional Appl. Ser. No. 60/458,398 (our 
docket Kelderl-USA) , filed March 31, 2003, we describe the 
identification of genes differentially expressed in normal 
vs. hyperinsulinemic, hyperinsulinemic vs. type II diabetic, 
or normal vs. type II diabetic mouse liver. Forward- and 
reverse -substracted cDMA libraries were prepared, clones 
were isolated, and differentially expressed cDNA inserts 
■were sequenced and compared with sequences in publicly 
available sequence databases. The corresponding mouse and 
human genes and proteins were, identified. Favorable 
genes/proteins so identified included (i) np_000767: cytochrome 
P450, subfamily IIIA (niphedipine oxidase), polypeptide 3; (2) AAG31034: 
SYT/SSX4 fusion protein ; and (3) NP_003158: sulfotransf erase family, 
cytosolic, 2A, dehydroepiandrosterone (DHEA) -preferring , member l ; 
sulfotransf erase family 2 A, dehydroepiandrosterone (DHEA) -preferring , 
member i. Unfavorable proteins included (4) np_004884: H2A 
histone family, member Y isoform 2; histone macroH2Al . 2 ; histone 
macroH2Al.l; (5) AAH37738.- Unknown (protein for MGC:33851); (6) 
NP_068839: integral membrane protein 2B ; (7) CAA28659: S-protein ; 
and (8) AAA51560i alpha-l-antichymotrypsin precursor . Mixed 
proteins included (9) NP_000769: cytochrome P450, subfamily IVA, 
polypeptide 11; fatty acid omega -hydroxylase ; P450HL-omega ; alkane-1 
monooxygenase ; lauric acid omega-hydroxylase ; (10) NP_006206 : serine (or 
cysteine) proteinase inhibitor, clade A ; (11) NP_004489: one cut 
domain, family member 1; hepatocyte nuclear factor 6, alpha; and (12) 
MP_775491: liver- specif ic uridine phosphorylase . Gene chip 
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technology was not used. Two of the genes (NM_0 0 7818 and 
NM_007822) were also identified in the present case. 

The use of differential hybridisation to identify genes and 
proteins is also described in our Ser. No. PCT/US0 0/1214 5 (Kopchick 3A- 
5 PCT) , Ser. No. PCT/US00/12366 (Kopchick4A-PCT) , and Ser. No. 60/400,052 

(Kopchick5) . All of the above applications are incorporated by- 
reference in their entirety. 

BACKGROUND OF THE INVENTION 

10 Field of the Invention 

The invention relates to various nucleic acid molecules 
and proteins, and their use in (1) diagnosing 
hyperinsulinemia and type II diabetes, or conditions 
associated with their development, and (2) protecting 

15 mammals (including humans) against them. 

Description of the Background Art 
Diabetes 

Diabetes mellitus is a pleiotropic disease of great 

2 0 complexity. The two major types have been termed type I or 
insulin- dependent diabetes mellitus (IDDM) and type II or 
non- insulin- dependent diabetes mellitus (NIDDM) . Type II 
diabetes is the predominant form found in the Western world; 
fewer than 8% of diabetic Americans have the type I disease. 

25 Type I diabetics are often characterized by their low 

or absent levels of circulating endogenous insulin, .i.e., 
hypoinsulinemia (1) . Islet cell antibodies causing damage 
to the pancreas are frequently present at diagnosis. 
Injection of exogenous insulin is required to prevent 

30 ketosis and sustain life. 

Early Type II diabetics are often characterized by 
hyperinsulinemia and resistance to insulin. Late Type II 
diabetics may be normoinsulinemic or hypoinsulinemic . Type 
II diabetics are usually not insulin dependent or prone to 

35 ketosis under normal circumstances. 



Type II Diabetes 

Type II diabetes (formerly known as non- insulin 
dependent diabetes, NIDDM) is the most common form of 
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elevated blood glucose (hyperglycemia) . Type II diabetes is 
a metabolic disorder that affects approximately 17 million 
Americans. It is estimated that another 10 million 
individuals are "prone" to becoming diabetic. These 
5 vulnerable individuals can become resistant to insulin, a 

pancreatic hormone that signals glucose (blood sugar) uptake 
by fat and muscle. In order to maintain normal glucose 
levels, the islet cells of the pancreas produce more 
insulin, resulting in a condition called hyperinsulinemia . 
10 When the pancreas can no longer produce enough insulin to 

compensate for the insulin resistance, and thereby maintain 
normal glucose levels, Type II diabetes (hyperglycemia) 
results . 

Complications of diabetes (end organ damage) include 
15 retinopathy, neuropathy, and nephropathy (traditionally 
designated as microvascular complications) as well as 
atherosclerosis (a macrovascular complication) . 

Early stages of hyperglycemia can usually be controlled 
by an alteration in diet and increasing the amount of 
20 exercise, but drug treatment, including insulin, may be 

required. It has been shown that meticulous blood glucose '. 
control can often slow down or halt the progression of 
diabetic complications if caught early enough (1) . However, 
tight metabolic control is extremely difficult to achieve. 

25 

Little is known about the disease progression from the 
normoinsulinemic state to the hyperinsulinemic state, and 
from the hyperinsulinemic state to the Type II diabetic 
state . 

3 0 As stated above, type II diabetes is a metabolic 

disorder that is characterized by insulin resistance and 
impaired glucose-stimulated insulin secretion (2,3,4). 
However, Type II diabetes and atherosclerotic disease are 
viewed as consequences of having the insulin resistance 

3 5 syndrome (IRS) for many years (5) . The current theory of 

the pathogenesis of Type II diabetes is often referred to as 
the "insulin resistance/islet cell exhaustion" theory. 
According to this theory, a condition causing insulin 
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resistance compels the pancreatic islet cells to 
hypersecrete insulin in order to maintain glucose 
homeostasis. However, after many years of hypersecretion, 
the islet cells eventually fail and the symptoms of clinical 
5 diabetes are. manifested. Therefore, this theory implies 

that, at some point, peripheral hyperinsulinemia will be an 
antecedent of Type II diabetes. Peripheral hyperinsulinemia 
can be viewed as the difference between what is produced by 
the p cell minus that which is taken up by the liver.. 

10 Therefore, peripheral hyperinsulinemia can be caused by 
increased p cell production, decreased hepatic uptake or 
some combination of both. It is also important to note that 
it is not possible to determine the origin of insulin 
resistance once it is established since the onset of 

15 peripheral hyperinsulinemia leads to a condition of global 
insulin resistance. 

Multiple environmental and genetic factors are involved 
in the development of insulin resistance, hyperinsulinemia 
2 0 and type II diabetes. An important risk factor for the 

development of insulin resistance, hyperinsulinemia and type 
II diabetes is obesity, particularly visceral obesity 
(6,7,8) . Type II diabetes exists world-wide, but in 
developed s'ocieties, the prevalence has risen as the average 

2 5 age of the population increases and the average individual 

becomes more obese. 

Obesity is a serious and growing problem in the United 
States. Obesity-related health risks include high blood 
pressure, hardening of the arteries, cardiovascular disease, 

3 0 and Type II diabetes (also known as non- insulin- dependent 

diabetes mellitus, Type II diabetes) (9,10,11) . Recent 
studies show that 85% of the individuals with Type II 
diabetes are obese (12) . 

3 5 Growth Hormone 

Growth hormone has many roles, ranging from regulation 
of protein, fat and carbohydrate metabolism to growth 
promotion. GH is produced in the somatrophic cells of the 
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anterior pituitary and exerts its effects either through the 
GH- induced action of IGF- I, in the case of growth promotion, 
or by direct interaction with the GHR on target cells 
including liver, muscle, adipose, and kidney cells. 
5 Hyposecretion of GH during development leads to dwarfism, 
and hypersecretion before puberty leads to gigantism. In 
adults, hypersecretion of GH results in acromegaly, a 
clinical condition characterised by enlarged facial bones, 
hands, feet, fatigue and an increase in weight. Of those 
10 individuals with acromegaly, 25% develop type II diabetes. 
This may be due to insulin resistance caused by the high 
circulating levels of GH leading to high circulating levels 
of insulin (Kopchick et al . , Annual Rev. Nutrition 1999. 
19 :437-61) . 

15 A further mode of GH action may be through the 

transcriptional regulation of a number of genes contributing 
to the physiological effects of GH. 



2 0 Transgenic Mice 

McGrane, et al . , J. Biol. Chem. ; 2 63 : 11443 -51 (1988) and 
Chen, et al . , J. Biol. Chem., 269:15892-7 (1994) describe 
the genetic engineering of mice to express bovine growth 
hormone (bGH) or human growth hormone (hGH) , respectively. 

25 These mice exhibited an enhanced growth phenotype . They 
also developed kidney lesions similar to those seen in 
diabetic glomerulosclerosis, see Yang, et al . , Lab. Invest., 
68:62-70 (1993). Ogueta, et al . , J. Endocrinol., 165: 321-8 
(2000) reported that transgenic mice expressing bovine GH 

30 develop arthritic disorder and self -antibodies . 

Growth hormone genes and the proteins encoded by them 
can be converted into growth hormone antagonists by 
mutation, see Kopchick USP 5,350,836. Transgenic mice have 
been made that. express the GH antagonists bGH-G119R or hGH 

35 G120R, and which exhibit a dwarf phenotype. Chen, et al . , 
J. Biol. Chem., 263:15892-7 (1994); Chen, et al . , Mol . 
Endocrinol, 5:1845-52 (1991); Chen, et al . , Proc . Nat. Acad. 
Sci. USA 87:5061-5 (1990). These mice did not develop 
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kidney lesions. See Yang (1993), supra . 

Chen, et al . , Endocrinol, 136:660-7 (1995) compared the 
effect of streptozotocin treatment in normal nontransgenic 
mice, and in mice transgenic for (1) a GH receptor 
antagonist, the G119R mutant of bovine growth hormone or (2) 
the E117L-mutant of bGH . (According to Chen's ref . 24, 
these large GH transgenic streptozotocin-treated mice 
constitute an animal model for diabetes.) 
Glomerulosclerosis was seen in diabetic (STZ- treated) 
nontransgenic mice and in diabetic bGH-E117L mice, but not 
in diabetic bGH-G119R (GH antagonist) mice. 

Two of the proteins which mediate growth hormone 
activity are the growth hormone receptor and the growth 
hormone binding protein, encoded by the same gene in 
mice (GHR/BP) . It is possible to genetically engineer- mice 
so that the gene encoding these proteins is disrupted 
( "knocked- out" ; inactivated), see Zhou, et al . , Proc . Nat. 
Acad. Sci. (USA), 94:13215-20 (1997). Zhou, et al . 
inactivated the GHR/BP gene by replacing the 3 ' portion of 
exon 4 (which encodes a portion of the GH binding domains) 
and the 5' region of intron 4 with a neomycin gene- cassette . 
The modified gene was introduced into the target mice by 
homologous recombination. Like mice expressing a GH 
antagonist, homozygous GHR/BP -KO mice exhibit a dwarf 
phenotype. GHR/BP-KO mice, made diabetic by streptozotocin 
treatment, are protected from the development of diabetes- 
associated nephropathy. Bellush, et al . , Endocrinol., 
141:163-8 (2000) . 

Di fferentia.l/Sub tra c ti ve Hybri di za ti on 

Zhang, et al., Kidney International, 56:549-558 (1999) 
identified genes up-regulated in 5/6 nephrectomized 
(subtotal renal ablation) mouse kidney by a PCR-based 
subtraction method. Ten known and nine novel genes were 
identified. The ultimate goal was to identify genes 
involved in glomerular hyperf iltration and hypertrophy. 

Melia, et al . , Endocrinol., 139:688-95 (1998) applied 
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subtractive hybridization methods for the identification of 
androgen- regulated genes in mouse kidney. The treatment 
mice were dosed with dihydrotestosterone , an androgen. 
Kidney androgen-regulated protein gene was used as a 
5 positive control, as it is known to be up-regulated by DHT. 

See also Holland, et al . , Abstract 607, "Identification 
of Genes Possibly Involved in Nephropathy of Bovine Growth 
Hormone Transgenic Mice" (Endocrine Society Meeting, June 
22, 2000) and Coschigano, et al . , Abstract 333, 
10 "Identification of Genes Potentially Involved in Kidney 

Protection During Diabetes" {Endocrine Society Meeting, June 
22, 2000) . 

The following differential hybridization articles may 
also be of interest: 
15 Wada, et al . , "Gene expression profile in 

streptozotocin- induced diabetic mice kidneys undergoing 
glomerulosclerosis", Kidney Int , 59:1363-73 (2001); 

Song, et al . , "Cloning of a novel gene in the human 
kidney homologous to rat muncl3S: its potential role in 
20 diabetic nephropathy", Kidney Int., 53:1689-95 (1998); 

Page, et al . , "Isolation of diabetes-associated kidney 
genes using differential display", Biochem. Biophys. Res. 
Comm., 232:49-53 (1997). 

Peradi, "Subtractive hybridization claims: An efficient 

2 5 technique to detect overexpressed mRNAs in diabetic 

nephropathy," Kidney Int. 53:926-31 (1998). 

Condorelli, EMBO J. , 17:3858-66 (1998). 

See also WO00/66784 (differential hybridization 
screening for brown adipose tissue) ; PCT/US00/12366, filed 

3 0 May 5, 2 000 (differential hybridization screening for 

liver) . 

Identification of genes involved in hyper insulinemia and 
type II diabetes 

35 High-fat diets have been shown to induce both obesity 

and Type II diabetes in laboratory animals (13) . Surwit and 
colleagues demonstrated that male C57BL/6J mice are 
extremely sensitive to the diabetogenic effects of a high- 
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fat diet when initiated at weaning. At six months of age, 
high-fat fed animals had significantly elevated fasting 
blood-glucose and insulin levels and also demonstrated a 
decrease in insulin sensitivity (14) . Ahren and colleagues 
5 (15) reported evidence of insulin resistance as well as 

diminished glucose-stimulated insulin release, after feeding 
with a high- fat diet for 12 weeks. These mice also showed 
elevated levels of total cholesterol, triglycerides, and 
free fatty acids, another hallmark of Type II diabetes. 

10 Our attention recently has focused on the generation of 

liver. mRNA expression profiles and the identification of 
genes involved in the genesis of the obesity- induced 
hyperinsulinemia and type- II diabetes. To date, no one has 
attempted to study the actual progression from the normal 

15 condition to that of hyperinsulinemia or from 

hyperinsulinemia to Type II diabetes in an attempt to 
identify genes that are up-regulated or down- regulated as 
the disease progresses. 

In previous studies aimed at identifying genes involved 

20 in diabetes- induced glomerulosclerosis, differential display 
and traditional subtractive hybridization techniques were 
used (16-20) . While effective for the identification of a 
few genes (e.g. hmuncl3, PED/PEA-15, lactate dehydrogenase, 
amiloride sensitive sodium channel, ubiquitin-like protein, 

2 5 mdr 1, and a-amyloid protein precursor as well as a few 

novel genes), these techniques can be quite labor intensive. 
The PCR-based method of subtractive hybridization requires 
less starting material, and allows the simultaneous 
isolation of all differentially expressed cDNAs into two 

3 0 groups (up-regulated and down- regulated) . 

However, the PCR-based method of subtractive 
hybridization is also quite labor-intensive, produced large 
numbers of false positive candidates and ultimately resulted 
in the identification of a relatively limited number of 
35 differentially expressed genes, (see Kelderl-USA 
application) . 

In order to expand the number of genes that can be 
analyzed simultaneously, several groups have begun to 
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utilize DNA microarray analysis to measure differences in 
gene expression between normal and diseased states. 
However, these experiments have been limited in regards to 
the number of experimental conditions analyzed. DNA 
5 microarray analysis has been performed on normal, obese and 
diabetic mice (21) . Also, the obesity and diabetes in the 
mouse models examined were caused by a specific endogenous 
genetic mutation (22) . The differentially expressed genes 
in the above models may be very different from genes 
10 differentially expressed due to diet-induced obesity and 
Type- II diabetes. 



WO 20(14/092416 



PCT/IIS2004/0KI191 



10 

SUMMARY OF THE INVENTION 

Differential hybridization techniques have been used to 
identify mouse genes that are differentially expressed in 
mice, depending upon their development of hyperinsulinemia 
5 or type II diabetes. 

In essence, complementary RNA derived from normal mice, 
or mouse models of hyperinsulinemia or type II diabetes, was 
screened for hybridization with oligonucleotide probes each 
specific to a particular mouse gene, each gene in turn 

10 . representative of -a particular mouse gene cluster (Unigene) . 
Mouse genes which were differentially expressed (normal vs. 
hyper insul inemi c , hyper insulinemic vs. diabetic, or normal 
vs. diabetic), as measured by different levels of 
hybridization of the respective cRNA samples with the 

15 particular probe corresponding to that mouse gene) were 
identified. Related human genes and proteins were 
identified by sequence comparisons to the mouse gene or 
protein. 

2 0 After identifying related human genes and proteins, one 

may formulate agents useful in screening humans at risk for 
progression toward hyperinsulinemia or toward type II 
diabetes . 

Since the progression is from normal to 
25 hyperinsul inemi c, and thence from hyperinsulinemic to type 

II diabetic, one may define mammalian subjects as being more 
favored or less favored, with normal subjects being more 
favored than hyperinsulinemic subjects, and hyperinsulinemic 
subjects being more favored than type II diabetic subjects. 

3 0 The subjects' state may then be correlated with their gene 

expression activity. 

Thus, "favorable" human genes/proteins are defined as 
those corresponding to mouse genes which were less strongly 
expressed in mouse hyperinsulinemic liver than in control 
35 liver, or less strongly expressed in mouse type II diabetic 
liver than in hyperinsulinemic liver. (The control liver is 
the liver of a mouse which is normal vis-a-vis fasting 
insulin and fasting glucose levels . The term "normal" , as 
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used herein, means normal relative to those parameters, and 
does not necessitate that the mouse be normal in every 
respect.) Likewise, one may define "unfavorable" human 
genes/proteins as those corresponding to mouse genes which 
5 were more strongly expressed in mouse hyperinsulinemic liver 
than in control liver, or more strongly expressed in mouse 
type II diabetic liver than in hyperinsulinemic liver. 

As used herein, the term "corresponding" does not mean 
identical, but rather implies the existence of a 

10 statistically significant sequence similarity, such as one 
sufficient to qualify the human protein or gene as a 
homologus protein or DNA as defined below. The greater the 
degree of relationship as thus defined (i.e., by the 
statistical significance of each alignment used to connect 

15 the mouse cDNA to the human protein or gene, measured by an 
E value), the more close the correspondence. The connection 
may be direct (mouse gene to human protein) or indirect 
(e.g., mouse gene to human gene, human gene to human 
protein) .By "mouse gene", we mean the mouse gene from which 

2 0 the gene chip DNA in question was derived. 

In general, the human genes/proteins which most closely 
correspond, directly or indirectly, to the mouse genes are 
preferred, such as the one(s) with the highest, top two 
highest, top three highest, top four highest, top five 
25 highest, and top ten highest E values for the final 
alignment in the connection process. The human 
genes/proteins deemed to correspond to our mouse cDNA clones 
are identified in the Master Tables. 

A human gene/protein corresponding to a mouse cDNA 

3 0 which was more strongly expressed in hyperinsulinemic liver 

than in either normal or type II diabetic liver (i.e., C<HI , 
HI>D) will be deemed both "unfavorable", by virtue of the 
control : hyperinsulinemic comparison, and "favorable", by 
virtue of the hyperinsulinemic : diabetic comparison. This is 
35 one of several possible "mixed" expression patterns. 

Thus, we can subdivide the "f avorables" into wholly and 
partially f avorables. Likewise, we can subdivide the 
unf avorables into wholly and partially unf avorables . The 
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genes/proteins with "mixed" expression patterns are, by- 
definition, both partially favorable and partially 
unfavorable. In general, use of the wholly favorable or 
wholly unfavorable genes'/proteins is preferred to use of the 
5 partially favorable or partially unfavorable ones. 



Agents which bind the "favorable" and "unfavorable" 

10 nucleic acids (e.g., the agent is a substantially 

complementary nucleic acid hybridization probe) , or the 
corresponding proteins (e.g., an antibody vs. the protein) 
may be used to evaluate whether a human subject is at 
increased or decreased risk for progression toward type II 

15 diabetes. A subject with one or more elevated, "unfavorable" 
and/or one or more depressed "favorable" genes/proteins is 
at increased risk, and one with one or more elevated 
"favorable" and/or one or more depressed "unfavorable" 
genes/proteins is at decreased risk. One may further take 

20 into account whether the subject is normoinsulinemic or 

hyperinsulinemic at the time of the assay. If the subject 
is non-diabetic and normoinsulinemic, we are especially 
interested in the "favorable" and "unfavorable" 
genes/proteins corresponding to mouse genes differentially 

25 expressed in hyperinsulinemic vs. normal livers. If the 
subject is already hyperinsulinemic, yet non-diabetic, we 
are especially interested in the "favorable" and 
"unfavorable" genes/proteins corresponding to mouse genes 
differentially expressed in type II diabetic vs. 

30 hyperinsulinemic livers. 

The assay may be used as a preliminary screening assay 
to select subjects for further analysis, or as a formal 
diagnostic assay. 

35 The identification of the related genes and proteins 

may also be useful in protecting humans against these 
disorders . 

Thus, Applicants contemplate: 
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(1) use of the "favorable" mouse DNAs of the Master 
Tables (below) to isolate or identify related human DNAs; 

(2) use of human DNAs, related to favorable mouse DNAs, 
to express the corresponding human proteins ; 

(3) use of the corresponding human proteins (and mouse 
proteins, if biologically active in humans) , to protect 
against the disorder (s) ; 

(4) use of the corresponding mouse or human proteins, 
or nucleic acid probes derived from the mouse or human 
genes, in diagnostic agents, in assays to measure 
progression toward hyperinsulinemia or type II diabetes, or 
protection against the disorder (s), or to estimate related 
end organ damage such as kidney damage; and 

(5) use of the corresponding human or mose genes 
therapeutically in gene therapy, to protect against the 
disorder (s) . 

Moreover Applicants contemplate: 

(1) use of the "unfavorable" mouse DNAs of the Master 
Tables to isolate or identify related human DNAs,- 

(2) use of the complement to the "unfavorable" mouse 
DNAs or .'related human DNAs, as antisense molecules to 
inhibit expression of the related human DNAs ; 

(3) use of the mouse or human DNAs to express the 
corresponding mouse or human proteins; 

(4) use of the corresponding mouse or human proteins, 
in diagnostic agents, to measure progression toward 
hyperinsulinemia or type II diabetes, or protection against 
the disorder (s), or to estimate related end organ damage 
such as kidney damage; 

(5) use of the corresponding mouse or human proteins in 
assays to determine whether a substance binds to (and hence 
may neutralize) the protein; and 

(6) use of the neutralizing substance to protect 
against the disorder (s). 

The related human DNAs may be identified by comparing 
the mouse sequence (or its AA translation product) to known 
human DNAs (and their AA translation products) . If this is 
unsuccessful, human cDNA or genomic DNA libraries may be 
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screened using the mouse DNA as a probe . 
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DETAILED DESCRIPTION OF THE PREFERRED EMBODIMENTS OF THE 
INVENTION 



Subj ects 

5 A mouse is considered to be a diabetic subject if, 

regardless of its fasting plasma insulin level, it has a 
fasting plasma glucose level of at least 190 mg/dL. A mouse 
is considered to be a hyperinsulinemic subject if its 
fasting plasma insulin level is at least 0.67 ng/mL and it 

10 does not qualify as a diabetic subject. A mouse is 

considered to be "normal" if it is neither diabetic nor 
hyperinsulinemic. Thus, normality is defined in a very 
limited manner. 

A mouse is considered "obese" if its weight is at least 

15 15% in excess of the mean weight for mice of its age and 
sex. A mouse which does not satisfy this standard may be 
characterized as "non-obese" , the term "normal" being 
reserved for use in reference to glucose and insulin levels 
as previously described. 

20 

A human is considered a diabetic subject if, regardless 
of his or her fasting plasma insulin level, the fasting 
plasma glucose level is at least 126 mg/dL. A human is 
considered a hyperinsulinemic subject if the fasting plasma 

25 insulin level is more than 2 6 micro International Units/mL 
(it is believed that this is equivalent to 1.08 ng/mL), and 
does not qualify as a diabetic subject. A human is 
considered to be "normal" if it is neither diabetic nor 
hyperinsulinemic. Thus, normality is defined in a very 

3 0 limited manner. 

A human is considered "obese" if the body mass index 
(BMI) (weight divided by height squared) is at least 3 0 
kg/m 2 . A human who does not satisfy this standard may be 
characterized as "non-obese", the term "normal" being 

35 reserved for use in reference to glucose and insulin levels 
as previously described. 

A human is considered overweight if the BMI is at least 
25 kg/m 2 . Thus, we define overweight to include obese 
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individuals, consistent with the recommendations of the 
National Institute of Diabetes and Digestive and Kidney- 
Diseases (NIDDK) . A human who does not satisfy this standard 
may be characterized as "non-overweight." 

5 

According to the Report of the Expert Commit te on the 
Diagnosis and Classification of Diabetes Mellitus, Diabetes 
Care 20: 1183-97 (1997), the following are risk factors for 
diabetes type II: 

10 

older (e.g., at least 45; see below) 

excessive weight (see below) 

15 first -degree relative with diabetes mellitus 

member of high risk ethnic group (black, Hispanic, 
Native American, Asian) 

2 0 history of gestational diabetes mellitus or delivering 

a baby weighing more than 9 pounds (4.032 kg) 

hypertensive (>14 0/90 mm Hg) 
25 HDL cholesterol level >35 mg/dL (0.90 mmol/L) 

triglyceride level >=250 mg/dL (2.83 mmol/L) 

Hence, in a preferred embodiment, the diagnostic and 

3 0 protective methods of the present invention are applied to 

human subjects exhibiting one or more of the aforementioned 
risk factors. Likewise, in a preferred embodiment, they are 
applied to human subjects who, while not diabetic, exhibit 
impaired glucose homeostasis (110 to <126 mg/dL) . 

35 

The risk of diabetes increases with age. Hence, in 
successive preferred embodiments, the age of the subjects is 
at least 45, at least 50, at least 55, at least 60, at least 
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65, at least 70, and at least 75. 

With regard to excessive weight, NIDDK says that "The 
relative risk of diabetes increases by approximately 25 
percent for each additional unit of BMI over 22." Hence, in 
5 successive preferred embodiments, the BMIs of the human 
subjects is at least 23, at least 24, at least 25 (i.e., 
overweight by our criterion), at least 26, at least 27, at 
least 28, at least 29, at least 30 (i.e., obese), at least 
31, at least 32, at least 33, at least 34, at least 35, at 
10 least 36, at least 37, at least 38, at least 39, at least 
40, or over 40. 

Genes/Proteins of Interest 

15 Favorable genes/proteins are those corresponding to 

genes less strongly expressed in hyperinsulinemic liver than 
in normal liver, or in type II diabetic liver as compared to 
hyperinsulinemic liver. Unfavorable genes/proteins are 
those corresponding to genes more strongly expressed in 

2 0 hyperinsulinemic liver than in normal liver, or in type II 
diabetic liver as compared to hyperinsulinemic liver. 

Mixed genes/proteins are those exhibiting a combination 
of favorable and unfavorable behavior. A mixed gene/protein 
can be used as would a favorable gene/protein if its 

25 favorable behavior outweighs the unfavorable. It can be 

used as would an unfavorable gene/protein if its unfavorable 
behavior outweighs the favorable. Preferably, they are used 
in conjunction with other agents that affect their balance 
of favorable and unfavorable behavior. Use of mixed 

30 genes/proteins is, in general, less desirable than use of 
purely favorable or purely unfavorable genes/proteins. 

For each of the differentially expressed genes, 
corresponding mouse and human proteins have been identified, 
35 as set forth in the Master Tables. 



Direct and Indirect Utility of Identified Nucleic Acid 
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Sequences and Related Molecules 

The mouse or human genes (or fragments thereof) may be 
used directly. For diagnostic or screening purposes, they 
(or specific binding fragments thereof) may be labeled and 
used as hybridization probes. For therapeutic purposes, 
they (or specific binding fragments thereof) may be used as 
antisense reagents to inhibit the expression of the 
corresponding gene, or of a sufficiently homologous gene of 
another species. 

Since each of the probes is representative of a full- 
length mouse gene, that is, it encodes an entire, functional 
protein, then it may be used in the expression of that 
protein. Likewise, if the corresponding human gene is known 
in full-length, it may be used to express the human protein. 
Such expression may be in cell culture, with the protein 
subsequently isolated and administered exogenously to 
subjects who would benefit therefrom, or in vivo, i.e., 
administration by gene therapy. Naturally, any DNA encoding 
the same protein, or a fragment or a mutant protein which 
retains the desired activity, may be used, for the same 
purpose. The encoded protein of course has utility 
therapeutically and, in labeled or immobilized form, 
diagnostically . 

The genes may also be used indirectly, that is, to 
identify other useful DNAs, proteins, or other molecules. 

There thus are several ways that a human protein 
homologue of interest can be identified by database 
searching, including: 

1) a DNA- >DNA (BlastN) search for database DNAs closely 
related to the mouse gene identifies a known human gene, and 
the sequence of the human protein is deduced by the Genetic 
Code ; 

2) a DNA->Protein (BlastX) search for database proteins 
closely related to the translated DNA of the mouse gene 
identifies a known human protein; and 
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3) the sequence of the mouse protein is known or is deduced 
by the Genetic Code, and a Protein- >Protein (BlastP) search 
for closely related database proteins identifies a known 
human protein. 

5 

Once a known human gene is identified, it may be used in 
further BlastN or BlastX searches to identify other human 
genes or proteins. Once a known human protein is 
identified, it may be used in further BlastP searches to 
10 identify other human proteins. 

Searches may also take cognizance, intermediately, of known 
genes and proteins other than mouse or human ones, e.g., use 
the mouse sequence to identify a known rat sequence and then 
15 the rat sequence to identify a human one. 

Thus, if we have identified a mouse gene, and it 
encodes a mouse protein which appears similar to a human 
protein, then that human protein may be used (especially in 
2 0 humans) for purposes analogous to the proposed use of the 
mouse protein in mice. Moreover, a specific binding 
fragment of an appropriate strand of the corresponding human 
■ gene or cDNA could be labeled and used as a hybridization 
probe (especially against samples of human mRNA or cDNA) . 

2 5 In determining whether the disclosed genes have 

significant similarities to known DNAs (and their translated 
AA sequences to known proteins) , one would generally use the 
disclosed gene as a query sequence in a search of a sequence 
database. The results of several such searches are set 
30 forth in the Examples. Such results are dependent, to some 
degree, on the search parameters. Preferred parameters are 
set forth in Example 1. The results are also dependent on 
the content of the database. While the raw similarity score 
of a particular target (database), sequence will not vary 

3 5 with content (as long as it remains in the database) , its 

informational value (in bits) , expected value, and relative 
ranking can change. Generally speaking, the changes are 
small . 
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It will be appreciated that the nucleic acid and 
protein databases keep growing. Hence a later search may- 
identify high scoring target sequences which were not 
5 uncovered by an earlier search because the target sequences 
were not previously part of a database . 

Hence, in a preferred embodiment, the cognate DNAs and 
proteins include not only those set forth in the examples, 
but those which would have been highly ranked (top ten, more 
10 . preferably top three, even more preferably top two, most 
preferably the top one) in a search run with the same 
parameters on the date of filing of this application. 

If the known human DNA is appears to be a partial DNA, 
15 it may be used as a hybridization probe to isolate the full- 
length DNA. If the partial DNA encodes a biologically 
functional fragment of the cognate protein, it may be used 
in a manner similar to the full length DNA, i.e., to produce 
the functional fragment. 

20 

If we have indicated that an antagonist of a protein or 
other molecule is useful, then such an antagonist may be 
obtained by preparing a combinatorial library, as described 
below, of potential antagonists, and screening the library 

25 members for binding to the protein or other molecule in 

question. The binding members may then be further screened 
for the ability to antagonize the biological activity of the 
target. The antagonists may be used therapeutically, or, in 
suitably labeled or immobilized form, diagnost ically . 

3 0 If the identified DNA is related to a known protein, 

then substances known to interact with that protein (e.g., 
agonists, antagonists, substrates, receptors, second 
messengers, regulators, and so forth), and binding molecules 
which bind them, are also of utility. Such binding 

35 molecules can likewise be identified by screening a 
combinatorial library. 

Isolation of Full Length cDNAs Using Partial cDNAs as probes 
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If it is determined that a DNA of the present invention 
is a partial DNA, and the cognate full length DNA is not 
listed in a sequence database, the available DNA may be used 
as a hybridization probe to isolate the full-length cDNA 
5 from a suitable cDNA library. 

Stringent hybridization conditions are appropriate, 
that is, conditions in which the hybridization temperature 
is 5-10 deg. C. below the Tm of the cDNA as a perfect 
duplex. 

10 

Identification and Isolation of Homologous Genes/ cDNAs Using 
a cDNA Probe 

It may be that the sequence databases available do not 
include the sequence of any homologous gene, or at least of 
15 the homologous gene for a species of interest. However, 

given the cDNAs set forth above, one may readily obtain the 
homologous gene. 

The possession of one DNA (the "starting DNA") greatly 
facilitates the isolation of homologous genes/cDNAs. If 

2 0 only a partial DNA is known, this partial DNA may first be 

used as a probe to isolate the corresponding . full length DNA 
for the same species, and that the latter may be used as the 
starting DNA in the search for homologous genes. 

The starting DNA, or a fragment thereof, is used as a 
25 hybridization probe to screen a cDNA or genomic DNA library 
for clones containing inserts which encode either the entire 
homologous protein, or a recognizable fragment thereof. The 
minimum length of the hybridization probe is dictated by the 
need for specificity. If the size of the library in bases 

3 0 is L, and the GC content is 50%, then the probe should have 

a length of at least 1, where L = 4 1 . This will yield, on 
average, a single perfect match in random DNA of L bases. 
The human cDNA library is about 10 s bases and the human 
genomic DNA library is about 10 10 bases. 
3 5 The library is preferably derived from an organism 

which is known, on biochemical evidence, to produce a 
homologous protein, and more preferably from the genomic DNA 
or mRNA of cells of that organism which are likely to be 
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relatively high producers of that protein. A cDNA library 
(which is derived from an mRNA library) is especially 
preferred . 

If the organism in question is known to have 
5 substantially different codon preferences from that of the 
organism whose relevant cDNA or genomic DNA is known, a 
synthetic hybridization probe may be used which encodes the 
same amino acid sequence but whose codon utilization is more 
similar to that of the DNA of the target organism. 

10 Alternatively, the synthetic probe may employ inosine as a 
substitute for those bases which are most likely to be 
divergent, or the probe may be a mixed probe which mixes the 
codons for the source DNA with the preferred codons 
(encoding the same amino acid) for the target organism. 

15 By routine methods, the Tm of a perfect duplex of 

starting DNA is determined. One may then select a 
hybridization temperature which is sufficiently lower than 
the perfect duplex Tm to allow hybridization of the starting 
DNA (or other probe) to a target DNA which is divergent from 

2 0 the starting DNA. A 1% sequence divergence typically lowers 
the Tm of a duplex by 1-2°C, and the DNAs encoding 
homologous proteins of different species typically have 
sequence identities of around 50-80%. Preferably, the 
library is screened under conditions where the temperature 

25 is at least 20°C, more preferably at least 50°C, below the 
perfect duplex Tm. Since salt reduces the Tm, one 
ordinarily would carry out the search for DNAs encoding 
highly homologous proteins under relatively low salt 
hybridization conditions, e.g., <1M NaCl . The higher the 

30 salt concentration, and/or the lower the temperature, the 
greater the sequence divergence which is tolerated. 

For the use of probes to identify homologous genes in 
other species, see, e.g., Schwinn, et al . , J. Biol. Chem. , 
265:8183-89 (1990) (hamster 67 -bp cDNA probe vs. human 

35 leukocyte genomic library; human 0.32kb DNA probe vs. bovine 
brain cDNA library, both with hybridization at 42 °C in 
6xSSC) ; Jenkins et al . , J. Biol. Chem., 265:19624-31 (1990) 
(Chicken 770-bp cDNA probe vs. human genomic libraries; 



WO 20(14/092416 



PCT/US2004/010I91 



23 

hybridization at 4 0°C in 50% formamide and 5xSSC) ; Murata et 
al . , J. Exp. Med., 175:341-51 (1992) (1.2-kb mouse cDNA 
probe v. human eosinophl cDNA library; hybridization at 65°C 
in 6xSSC) ; Guyer et al., J. Biol. Chem. , 265:17307-17 (1990) 
5 (2.95-kb human genomic DMA probe vs. porcine genomic DMA 

library; hybridization at 42 °C in 5xSSC) . The conditions 
set forth in these articles may each be considered suitable 
for the purpose of isolating homologous genes. 

10 Homologous Proteins and DNAs 

A human protein can be said to be identifiable as 
homologous to a mouse gene (and hence to "correspond" to 
such gene) if 

15 

(1) its sequence can be aligned to the mouse gene, using 
BlastX with the default parameters set forth below, and the 
expected value (E) of the alignment (the probability that 
such an alignment would have occurred by chance alone) is 

20 less than e-10, 

(2) its sequence can be aligned to a human gene, using 
BlastX with the default parameters set forth below, and the 
cDNA of said human gene can be aligned to the mouse gene, 

25 using BlastN with the default parameters set forth below, 
and the E value for both alignments is less than e-10, 

(3) its sequence can be aligned to a mouse protein, using 
BlastP with the default parameters set forth below, and that 

30 mouse protein can be aligned to the mouse gene, using BlastX 
with the default parameters set forth below, and in both 
alignments the E value of the alignment is less than e-10. 

Naturally, if the human protein is encoded by the human gene 
3 5 of (2) , or the mouse protein is encoded by the mouse gene of 
(3) , the BlastX alignment will be satisfied. 

Desirably, two or all three of these conditions (l)-(3) are 
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satisfied. 

Preferably, for any of the alignments noted above, and 
more preferably for all of them, the E value is less than e- 
5 15, more preferably less than e-2 0, still more preferably 
•less than e-40, even more preferably less than e-60, 
considerably more preferably less than e-80, and most 
preferably less than e-100. More preferably, for those 
conditions in which the mouse cDNA clone is indirectly 
10 connected to the human protein by virtue of two or more 

successive alignments, the E value is so limited for all of 
said alignments in the connecting chain. 

BlastN and BlastX report very low expected values as 
15 "0.0". This does not truly mean that the expected value is 
exactly zero (since any alignment could occur by chance) , 
but merely that it is so infinitesimal that it is not 
reported. The documentation does not state the cutoff 
value, alignments with explicit E values as low as e-178 
20 (624 bits) have been reported as such, while a score of 636 

bits was reported as "0.0". 

Functionally homologous human proteins are also of 
interest. A human protein may be said to be functionally 
25 homologous to the mouse gene if (l)it can be aligned to the 
mouse gene, using BlastX with the default parameters set 
forth below, and the E value of the alignment is less than 
e-50, and (2) the human protein has at least one biological 
activity in common with the mouse protein. 

30 

The human proteins of interest also include those that 
are substantially and/or conservatively identical (as 
defined below) to the homologous and/or functionally 
homologous human proteins defined above. 

35 
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Relevance of Favorable and Unfavorable Genes 

If a gene is down -regulated in more favored mammals, or 
up-regulated in less favored mammals, (i.e., an "unfavorable 
5 gene") then several utilities are apparent. 

First, the complementary strand of the gene, or a 
portion thereof, may be used in labeled form as a 
hybridization probe to detect messenger RNA and thereby 
monitor the level of expression of the gene in a subject. 

10 Elevated levels are indicative of progression, or 

propensity to progression, to a less favored state, and 
clinicians may take appropriate preventative, curative or 
ameliorative action. 

Secondly, the messenger RNA product (or equivalent 

15 cDNA) , the protein product, or a binding molecule specific 
for that product (e.g., an antibody which binds the 
product) , or a downstream product which mediates the 
activity (e.g., a signaling intermediate) or a binding 
molecule (e.g., an antibody) therefor, may be used, 

2 0 preferably in labeled or immobilized form, as an assay 

reagent in an assay for Said nucleic acid product, protein 
product, or downstream product (e.g., a signaling 
intermediate) . Again, elevated levels are indicative of a 
present or future problem. 

2 5 Thirdly, an agent which down- regulates expression of 

the gene may be used to reduce levels of the corresponding 
protein and thereby inhibit further damage. This agent 
could inhibit transcription of the gene in the subject, or 
translation of the corresponding messenger RNA. Possible 

3 0 inhibitors of transcription and translation include 

antisense molecules and repressor molecules. The agent 
could also inhibit a post-translational modification (e.g., 
glycosylation, phosphorylation, cleavage, GPI attachment) 
required for activity, or post-translationally modify the 
35 protein so as to inactivate it. Or it could be an agent 
which down- or up-regulated a positive or negative 
regulatory gene, respectively. 

Fourthly, an agent which is an antagonist of the 
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messenger RNA product or protein product of the gene, or of 
a downstream product through which its activity is 
manifested (e.g., a signaling intermediate), may be used to 
inhibit its activity. 
5 This antagonist could be an antibody, a peptide, a 

peptoid, a nucleic acid, a' peptide nucleic acid (PNA) 
oligomer, a small organic molecule of a kind for which a 
combinatorial library exists (e.g., a benzodiazepine), etc. 
An antagonist is simply a binding molecule which, by 

10 binding, reduces or abolishes the undesired activity of its 
target. The antagonist, if not an oligomeric molecule, is 
preferably less than 500 daltons. 

Fifthly, an agent which degrades, or abets the 
degradation of, that messenger RNA, its protein product or a 

15 downstream product which mediates its activity (e.g., a 

signaling intermediate) , may be used to curb the effective 
period of activity of the protein. 

If a gene is up-regulated in more favored mammals, or 
down -regulated in less favored animals then the utilities 

20 are converse to those stated above. 

First, the complementary strand of the gene, or a 
portion thereof, may be used in labeled form as a 
hybridization probe to detect messenger RNA and thereby 
monitor the level of expression of the gene in a subject. 

25 Depressed levels are indicative of damage, or possibly of a 
propensity to damage, and clinicians may take appropriate 
preventative, curative or ameliorative action. 

Secondly, the messenger RNA product, the equivalent 
cDNA, protein product, or a binding molecule specific for 

30 those products, or a downstream product, or a signaling 

intermediate, or a binding molecule therefor, may be used, 
preferably in labeled or immobilized form, as an assay 
reagent in an assay for said protein product or downstream 
product. Again, depressed levels are indicative of a 

3 5 present or future problem. 

Thirdly, an agent which up-regulates expression of the 
gene may be used to increase levels of the corresponding 
protein and thereby inhibit further progression to a less 
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favored state. By way of example, it could be a vector 
which carries a copy of the gene, but which expresses the 
gene at higher levels than does the endogenous expression 
system. Or it could be an agent which up- or down -regulates 
5 a positive or negative regulatory gene. 

Fourthly, an agent which is an agonist of the protein 
product of the gene, or of a downstream product through 
which its activity (of inhibition of progression to a less 
favored state) is manifested, or of a signaling intermediate 
10 may be used to foster its activity. 

Fifthly, an agent which inhibits the degradation of 
that protein product or of a downstream product or of a 
signaling intermediate may be used to increase the effective 
period of activity of the protein. 



Mutant Proteins 

The present invention also contemplates mutant proteins 
(peptides) which are substantially identical (as defined 

2 0 below) to the parental protein (peptide) . In general, the 
fewer the mutations, the more likely the mutant protein is 
to retain the activity of the parental protein. The effect 
of mutations is usually (but not always) additive. Certain 
individual mutations are more likely to be tolex~ated than 

2 5 others. 

A protein is more likely to tolerate a mutation which 

(a) is a substitution rather than an insertion or 
deletion; 

(b) is an insertion or deletion at the terminus, 
30 rather than internally, or, if internal, is at a 

domain boundary, or a loop or turn, rather than in 
an alpha helix or beta strand; 

(c) affects a surface residue rather than an 
interior residue; 

35 (d) affects a part of the molecule distal to the 

binding site; 

(e) is a substitution of one amino acid for 
another of similar size, charge, and/or 
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hydrophobicity , and does not destroy a disulfide 
bond or other crosslink; and '; 
(f) is at a site which is subject to substantial 
variation among a family of homologous proteins to 
5 which the protein of interest belongs. 

These considerations can be used to design functional 

mutants . 

Surface vs. Interior Residues 

10 Charged residues almost always lie on the surface of 

the protein. For uncharged residues, there is less 
certainty, but in general, hydrophilic residues are 
partitioned to the surface and hydrophobic residues to the 
interior. Of course, for a membrane protein, the membrane - 

15 spanning segments are likely to be rich in hydrophobic 
residues . 

Surface residues may be identified experimentally by 
various labeling techniques, or by 3-D structure mapping 
techniques like X-ray diffraction and NMR. A 3-D model of a 

2 0 homologous protein can be helpful. 

Binding Site Residues 

Residues forming the binding site may be identified by 
(1) comparing the effects of labeling the surface residues 
25 before and after complexing the protein to its target, (2) 
labeling the binding site directly with affinity ligands, 
(3) fragmenting the protein and testing the fragments for 
binding activity, and (4) systematic mutagenesis (e.g., 
alanine- scanning mutagenesis) to determine which mutants 

3 0 destroy binding. If the binding site of a homologous 

protein is known, the binding site may be postulated by 
analogy. 

Protein libraries may be constructed and screened that 
a large family (e.g., 10 s ) of related mutants may be 
3 5 evaluated simultaneously. 

Hence, the mutations are preferably conservative 
modifications as defined below. 
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"Substantially Identical" 

A mutant protein (peptide) is substantially identical 
to a reference protein (peptide) if (a) it ■ has at least 10% 
of a specific binding activity or a non-nutritional 
5 biological activity of the reference protein, and (b) is at 
least 50% identical in amino acid sequence to the reference 
protein (peptide) . It is "substantially structurally 
identical" if condition (b) applies, regardless of (a). 
Percentage amino acid identity is determined by 

10 aligning the mutant and reference sequences according to a 

rigorous dynamic programming algorithm which globally aligns 
their sequences to maximise their similarity, the similarity 
being scored as the sum of scores for each aligned pair 
according to an unbiased PAM2 50 matrix, and a penalty for 

15 each internal gap of -12 for the first null of the gap and - 
4 for each additional null of the same gap. The percentage 
identity is the number of matches expressed as a percentage 
of the adjusted (i.e., counting inserted nulls) length of 
the reference sequence . 

20 A mutant DNA sequence is substantially identical to a 

reference DNA sequence if they are structural sequences, and 
encoding mutant and reference proteins which are 
substantially identical as described above. 

If instead they are regulatory sequences, they are 

25 substantially identical if the mutant sequence has at least 
10% of the regulatory activity of the reference sequence, 
and is at least 50% identical in nucleotide sequence to the 
reference sequence. Percentage identity is determined as 
for proteins except that matches are scored +5, mismatches - 

3 0 4, the gap open penalty is -12, and the gap extension 
penalty (per additional null) is -4. 

Preferably, sequence which are substantially identical 
exceed the minimum identity of 50% e.g., are 51%, 66%, 75%, 
80%, 85%, 90%, 95% or 99% identical in sequence. 

35 DNA sequences may also be considered "substantially 

identical" if they hybridize to each other under stringent 
conditions, i.e., conditions at which the Tm of the 
heteroduplex of the one strand of the mutant DNA and the 
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more complementary strand of the reference DNA is not in 
excess of 10 °C. less than the Tm of the reference DNA 
homoduplex. Typically this will correspond to a percentage 
identity of 85-90%. 

5 

"Conservative Modifications" 

"Conservative modifications" are defined as 
(a) conservative substitutions of amino acids as 
hereafter defined; or 
10 (b) single or multiple insertions (extension) or 

deletions (truncation) of amino acids at the 
termini . 

Conservative modifications are preferred to other 
modifications. Conservative substitutions are preferred to 
15 other conservative modifications. 

"Semi-Conservative Modifications" are modifications 
which are not conservative, but which are (a) semi- 
conservative substitutions as hereafter defined; or (b) 
single or multiple insertions or deletions internally, but 

2 0 at interdomain boundaries, in loops or in other segments of 

relatively high mobility. Semi -conservative modifications 
are preferred to nonconservat ive modifications. Semi- 
conservative substitutions are preferred to other semi- 
conservative modifications. 
25 Non- conservative substitutions are preferred to other 

non- conservative modifications. 

The term "conservative" is used here in an a priori 
sense, i.e., modifications which would be expected to 
preserve 3D structure and activity, based on analysis of the 

3 0 naturally occurring families of homologous proteins and of 

past experience with the effects of deliberate- mutagenesis, 
rather than post facto , a modification already known to 
conserve activity. Of course, a modification which is 
conservative a priori may, and usually is, also conservative 
3 5 post facto . 

Preferably, except at the termini, no more than about 
five amino acids are inserted or deleted at a particular 
locus, and the modifications are outside regions known to 



WO 2004/092416 



PCT/US2004/0 10191 



31 

contain binding sites important to activity. 

Preferably, insertions or deletions are limited to the 
termini . 

A conservative substitution is a substitution of one 
amino acid for another of the same exchange group, the 
exchange groups being defined as follows 

I Gly, Pro, Ser, Ala (Cys) (and any nonbiogenic, 
neutral amino acid with a hydrophobicity not 
exceeding that of the aforementioned a.a.'s) 

II Arg, Lys, His (and any nonbiogenic, positively- 
charged amino acids) 

III Asp, Glu, Asn, Gin (and any nonbiogenic 
negatively-charged amino acids) 

IV Leu, lie, Met, Val (Cys) (and any nonbiogenic, 
aliphatic, neutral amino acid with a 
hydrophobicity too high for I above) 

V Phe, Trp, Tyr (and any nonbiogenic, aromatic 
neutral amino acid with a hydrophobicity too high 
for I above) . 

Note that Cys belongs to both I and IV. 

Residues Pro, Gly and Cys have special conformational 
roles. Cys participates in formation of disulfide bonds. 
Gly imparts flexibility to the chain. Pro imparts rigidity 
to the chain and disrupts a helices. These residues may be 
essential in certain regions of the polypeptide, but 
substitutable elsewhere. 

One, two or three conservative substitutions are more 
likely to be tolerated than a larger number. 

"Semi -conservative substitutions" are defined herein as 
being substitutions within supergroup I/II/III or within 
supergroup IV/V, but not within a single one of groups I-V. 
They also include replacement of any other amino acid with 
alanine. If a substitution is not conservative, it 
preferably is semi -conservative . 

"Non-conservative substitutions" are substitutions 
which are not "conservative" or " semi -conservative" . 

"Highly conservative substitutions" are a subset of 
conservative substitutions, and are exchanges of amino acids 
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within the groups Phe/Tyr/Trp, Met/Leu/lle/Val , His/Arg/Lys, 
Asp/Glu and Ser/Thr/Ala. They are more likely to be 
.tolerated than other conservative substitutions. Again, the 
smaller the number of substitutions, the more likely they 
5 are to be tolerated. 

"Conservatively Identical" 

A protein (peptide) is conservatively identical to a 
reference protein (peptide) it differs from the latter, if 
10 at all, solely by conservative modifications, the protein 
(peptide remaining at least seven amino acids long if the 
reference protein (peptide) was at least seven amino acids 
long.. 

A protein is at least semi -conservatively identical to 
15 a reference protein (peptide) if it differs from the latter, 
if at all, solely by semi -conservative or conservative 
modifications. 

A protein (peptide) is nearly conservatively identical 
to a reference protein (peptide) if it differs from the 
20 latter, if at all, solely by one or more conservative 

modifications and/or a single nonconservative substitution. 

It is highly conservatively identical if it differs, if 
at all, solely by highly conservative substitutions. Highly 
conservatively identical proteins are preferred to those 
25 merely conservatively identical . An absolutely identical 
protein is even more preferred. 



The core sequence of a reference protein (peptide) is 
30 the largest single fragment which retains at least 10% of a 
particular specific binding activity, if one is specified, 
or otherwise of at least one specific binding activity of 
the referent. If the referent has more than one specific 
binding activity, it may have more than one core sequence, 
35 and these may overlap or not . 

If it is taught that a peptide of the present invention 
may have a particular similarity relationship (e.g., 
markedly identical) to a reference protein (peptide) , 
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preferred peptides are those which comprise a sequence 
having that relationship to a core sequence of the reference 
protein (peptide) , but with internal insertions or deletions 
in either sequence excluded. Even more preferred peptides 
5 are those whose entire sequence has that relationship, with 
the same exclusion, to a core sequence of that reference 
protein (peptide) . 

10 Library 

The term "library" generally refers to a collection of 
chemical or biological entities which are related in origin, 
structure, and/or function, and which can be screened 
simultaneously for a property of interest . 

15 Libraries may be classified by how they are constructed 

(natural vs. artificial diversity; combinatorial vs. 
noncombinatorial) , how they are screened (hybridization, 
expression, display) , or by the nature of the screened 
library members (peptides, nucleic acids, etc.). 

2 0 In a "natural diversity" library, essentially all of 

the diversity arose without human intervention. This would 
be true, for example, of messenger RNA extracted from a non- 
engineered cell. 

In a "synthetic diversity" library, essentially all of 

2 5 the diversity arose deliberately as a result of human 
intervention. This would be true for example of a 
combinatorial library; note that a small level of natural 
diversity could still arise as a result of spontaneous 
mutation. It would also be true of a noncombinatorial 

30 library of compounds collected from diverse sources, even if 
they were all natural products. 

In a "non-natural diversity" library, at least some of 
the diversity arose deliberately through human intervention. 
In a "controlled origin" library, the source of the 

35 diversity is limited in some way. A limitation might be to 
cells of a particular individual, to a particular species, 
or to a particular genus, or, more complexly, to individuals 
of a particular species who are of a particular age, sex, 
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physical condition, geographical location, occupation and/or 
familial relationship. Alternatively or additionally, it 
might be to cells of a particular tissue or organ. Or it 
could be cells exposed to particular pharmacological, 
5 environmental, or pathogenic conditions. Or the library 

could be of chemicals, or a particular class of chemicals, 
produced by such cells. 

In a "controlled structure" library, the library 
members are deliberately limited by the production 
10 conditions to particular chemical structures. For- example, 
if they are oligomers, they may be limited in length and 
monomer composition, e.g. hexapep tides composed of the 
twenty genetically encoded amino acids. 

15 Hybridization Library 

In a hybridization library, the library members are 
nucleic acids, and are screened using a nucleic acid 
hybridization probe. Bound nucleic acids may then be 
amplified, cloned, and/or sequenced. 

20 

Expression Library 

In an expression library, the screened libx~ary members 
are gene expression products, but one may also speak of an 
underlying library of genes encoding those products. The 

2 5 library is made by subcloning DNA encoding the library 

members (or portions thereof) into expression vectors (or 
into cloning vectors which subsequently are used to 
construct expression vectors) , each vector comprising an 
expressible gene encoding a particular library member, 

3 0 introducing the expression vectors into suitable cells, and 

expressing the genes so the expression products are 
produced . 

In one embodiment, the expression products are 
secreted, so the library can be screened using an affinity 
3 5 reagent, such as an antibody or receptor. The bound 

expression products may be sequenced directly, or their 
sequences inferred by, e.g., sequencing at least the 
variable portion of the encoding DNA. 
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In a second embodiment, the cells are lysed, thereby 
exposing the expression products, and the latter are 
screened with the affinity reagent . 

In a third embodiment, the cells express the library 
5 members in such a manner that they are displayed on the 

surface of the cells, or on the surface of viral particles 
produced by the cells. (See display libraries, below) . 

In a fourth embodiment, the screening is not for the 
ability of the expression product to bind to an affinity 
10 reagent, but rather for its ability to alter the phenotype 
of the host cell in a particular detectable manner. Here, 
the screened library members are transformed cells, but 
there is a first underlying library of expression products 
which mediate the behavior of the cells, and a second 
15 underlying library of genes which encode those products. 

Display Library 

In a display library, the library members are each 
conjugated to, and displayed upon, a support of some kind. 

2 0 The support may be living (a cell or virus) , or nonliving 

(e.g., a bead or plate). 

If the support is a cell or virus, display will 
normally be effectuated by expressing a fusion protein which 
comprises the library member, a carrier moiety allowing 

25 integration of the fusion protein into the surface of the 
cell or virus, and optionally a lining moiety. In a 
variation on this theme, the cell coexpresses a first fusion 
comprising the library member and a linking moiety LI, and a 
second fusion comprising a linking moiety L2 and the carrier 

30 moiety. LI and L2 interact to associate the first fusion 
with the second fusion and hence, indirectly, the library 
member with the surface of the cell or virus. 

Soluble Library 

3 5 In a soluble library, the library members are free in 

solution. A soluble library may be produced directly, or 
one may first make a display library and then release the 
library members from their supports. 
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Encapsulated Library 

In an encapsulated library, the library members are 
inside cells or liposomes. Generally speaking, encapsulated 
libraries are used to store the library members for future 
5 use; the members are extracted in some way for screening 
purposes. However, if they differentially affect the. 
phenotype of the cells, they may be screened indirectly by 
screening the cells. 

10 cDNA Library 

A cDNA library is usually prepared by extracting RNA 
from cells of particular origin, fractionating the RNA to 
isolate the messenger RNA (mRNA has a poly (A) tail, so this 
is usually done by oligo-dT affinity chromatography) , 

15 synthesizing complementary DNA (cDNA) using reverse 

transcriptase, DNA polymerase, and other enzymes, subcloning 
the cDNA into vectors, and introducing the vectors into 
cells. , Often, only mRNAs or cDNAs of particular sizes will 
be used, to make it more likely that the cDNA encodes a 

2 0 functional polypeptide.' 

A cDNA library explores the natural diversity of the 
transcribed DNAs of cells from a particular source. It is 
not a combinatorial library. 

A cDNA library may be used to make a hybridization 

2 5 library, or it may be used as an (or to make) expression 

library. 

Genomic DNA Library 

A genomic DNA library is made by extracting DNA from a 

3 0 particular source, fragmenting the DNA, isolating fragments 

of a particular size range, subcloning the DNA fragments 
into vectors, and introducing the vectors into cells. 

Like a cDNA library, a genomic DNA library is a natural 
diversity library, and not a combinatorial library. A 
3 5 genomic DNA library may be used the same way as a cDNA 
library. 

Synthetic DNA library 
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A synthetic DNA library may be screened directly (as a 
hybridization library) , or used in the creation of an 
expression or display library of pept ides/proteins . 

5 Combinatorial Libraries 

The term "combinatorial library" refers to a library in 
which the individual members are either systematic or random 
combinations of a limited set of basic elements, the 
properties of each member being dependent on the choice and 

10 location of the elements incorporated into it. Typically, 
the members of the library are at least capable of being 
screened simultaneously. Randomization may be complete or 
partial; some positions may be randomized and others 
predetermined, and at random positions, the choices may be 

15 limited in a predetermined manner. The members of a 

combinatorial library may be oligomers or polymers of some 
kind, in which the variation occurs through the choice of 
monomeric building block at one or more positions of the 
oligomer or polymer, and possibly in terms of the connecting 

2 0 linkage, or the length of the oligomer or polymer, too. Or 
the members may be honoligomeric molecules with a standard 
core structure, like the 1 , 4 -benzodiazepine structure, with 
the variation being introduced by the choice of substituents 
at particular variable sites on the core structure. Or the 

2 5 members may be nonoligomeric molecules assembled like a 

jigsaw puzzle, but wherein each piece has both one or more 
variable moieties (contributing to library diversity) and 
one or more constant moieties (providing the functionalities 
for coupling the piece in question to other pieces) . 

3 0 Thus, in a typical combinatorial library, chemical 

building blocks are at least partially randomly combined 
into a large number (as high as 10 15 ) of different compounds, 
which are then simultaneously screened for binding (or 
other) activity against one or more targets. 
35 In a "simple combinatorial library", all of the members 

belong to the same class of compounds (e.g., peptides) and 
can be synthesized simultaneously. A "composite 
combinatorial library" is a mixture of two or more simple 
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libraries, e.g., DNAs and peptides, or peptides, peptoids, 
- and PNAs , or benzodiazepines and carbamates. The number of 
component simple libraries in a composite library will, of 
course, normally be smaller than the average number of 
5 members in each simple library, as otherwise the advantage 
of a library over individual synthesis is small . 

Libraries of thousands, even millions, of random 
oligopeptides have' been prepared by chemical synthesis 
(Houghten et al . , Nature, 354:84-6(1991)), or gene 

10 expression (Marks et al . , J Mol Biol, 222:581-97(1991)), 

- displayed on chromatographic supports (Lam et al . , Nature, 
354:82-4(1991)), inside bacterial cells (Colas et al . , 
Nature, 380:548-550(1996)), on bacterial pili (Lu, 
Bio /Technology, 13:366-372(1990)), or phage (Smith, Science, 

15 228:1315-7(1985)), and screened for binding to a variety of 
targets including antibodies (Valadon et al . , J Mol Biol, 
261:11-22(1996)), cellular proteins (Schmitz et al., J Mol 
Biol, 260:664-677(1996)), viral proteins (Hong and 
Boulanger, Embo J, 14:4714-4727(1995)), bacterial proteins 

20 (Jacobsson and Frykberg, Biotechniques , 18:878-885(1995)), 

nucleic acids (Cheng et al . , Gene., 171:1-8(1996)), and 
plastic (Siani et al . , J Chem Inf Comput Sci, 34:588- 
593 (1994) ) . 

Libraries of proteins (Ladner, USP 4,664,989), peptoids 
25 (Simon et al . , Proc Natl Acad Sci USA, 89:9367-71(1992)), 

nucleic acids (Ellington and Szostak, Nature, 
246:818(1990)), carbohydrates, and small organic molecules 
(Eichler et al . , Med Res Rev, 15:481-96(1995)) have also 
been prepared or suggested for drug screening purposes . 
3 0 The first combinatorial libraries were composed of 

peptides or proteins, in which all or selected amino acid 
positions were randomized. Peptides and proteins can exhibit 
high and specific binding activity, and can act as 
catalysts. In consequence, they are of great importance in 
35 biological systems. 

Nucleic acids have also been used in combinatorial 
libraries . Their great advantage is the ease with which a 
nucleic acid with appropriate binding activity can be 
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amplified. As a result, combinatorial libraries composed of 
nucleic acids can be of low redundancy and hence, of high 
diversity . 

There has also been much interest in combinatorial 
5 libraries based on small molecules, which are more suited to 
pharmaceutical use, especially those which, like 
benzodiazepines, belong to a chemical class which has 
already yielded useful pharmacological agents. The 
techniques of combinatorial chemistry have been recognized 

10 as the most efficient means for finding small molecules that 
act on these targets. At present, small molecule 
combinatorial chemistry involves the synthesis of either 
pooled or discrete molecules that present varying arrays of 
functionality on a common scaffold. These compounds are 

15 grouped in libraries that are then screened against the 

target of interest either for binding or for inhibition of 
biological activity. 

The size of a library is the number of molecules in it. 
The simple diversity of a library is the number of unique 

2 0 structures in it. There is no formal minimum or maximum 
diversity. If the library has a very low diversity, the 
library has little advantage over just synthesizing and 
screening the members individually. If the library is of 
very high diversity, it may be inconvenient to handle, at 

25 least without automatizing the process. The simple 

diversity of a library is preferably at least 10, 10E2, 
10E3, 10E4, 10E6, 10E7, 10E8 or 10E9, the higher the better 
under most circumstances. The simple diversity is usually 
not more than 10E15, and more usually not more than 10E10. 

30 The average sampling level is the size divided by the 

simple diversity. The expected average sampling level must 
be high enough to provide a reasonable assurance that, if a 
given structure were expected, as a consequence of the 
library design, to be present, that the actual average 

35 sampling level will be high enough so that the structure, if 
satisfying the screening criteria, will yield a positive 
result when the library is screened. Thus, the preferred 
average sampling level is a function of the detection limit, 
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. which in turn is a function of the strength of the signal to 
be screened . 

There are more complex measures of diversity than 
simple diversity. These attempt to take into account the 
5 degree of structural difference between the various unique 
sequences. These more complex measures are usually used in 
the context of small organic compound libraries, see below. 

The library members may be presented as solutes in 
solution, or immobilized on some form of support. In the 

10 latter case, the support may be living (cell, virus) or 

nonliving (bead, plate, etc.) . The supports may be separable 
(cells, virus particles, beads) so that binding and 
nonbinding members can be separated, or nonseparable 
(plate) . In the latter case, the members will normally be 

15 placed on addressable positions on the support. The 

advantage of a soluble library is that there is no carrier 
moiety that could interfere with the binding of the members 
to the support. The advantage of an immobilized library is 
that it is easier to identify the structure of the members 

20 which were positive. 

When screening a soluble library, or one with a 
separable support, the target is usually immobilized. When 
screening a library on a nonseparable support, the target 
will usually be labeled. 

25 

Oligonucleotide Libraries 

An oligonucleotide library is a combinatorial library, 
at least some of whose members are single -stranded 
oligonucleotides having three or more nucleotides connected 

30 by phosphodiester or analogous bonds. The oligonucleotides 
may be linear, cyclic or branched, and may include non- 
nucleic acid moieties. The nucleotides are not limited to 
the nucleotides normally found in DMA or RNA. For examples 
of nucleotides modified to increase nuclease resistance and 

3 5 chemical stability of aptamers, see Chart 1 in Osborne and 
Ellington, Chem. Rev., 97: 349-70 (1997). For screening of 
RNA, see Ellington and Szostak, Nature, 346: 818-22 (1990) . 
There is no formal minimum or maximum size for these 
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oligonucleotides. However, the number of conformations which 
an oligonucleotide „ can assume increases exponentially with 
its length in bases. Hence, a longer oligonucleotide is 
more likely to be able to fold to adapt itself to a protein 
surface. On the other hand, while very long molecules can 
be synthesized and screened, unless they provide a much 
superior affinity to that of shorter molecules, they are not 
likely to be found in the selected population, for the 
reasons explained by Osborne and Ellington (1997) . Hence, 
the libraries of the present invention are preferably 
composed of oligonucleotides having a length of 3 to 100 
bases, more preferably 15 to 3 5 bases. The oligonucleotides 
in a given library may be of the same or of different 
lengths . 

Oligonucleotide libraries have the advantage that 
libraries of very high diversity (e.g., 10 15 ) are feasible, 
and binding molecules are readily amplified in vitro by 
polymerase chain reaction (PCR) . Moreover, nucleic acid 
molecules can have very high specificity and affinity to 
targets . 

In a preferred embodiment, this invention prepares and 
screens oligonucleotide libraries by the SELEX method, as 
described in King and Famulok, Molec . Biol. Repts., 20: 97- 
107 (1994); L. Gold, C. Tuerk. Methods of producing nucleic 
acid ligands, US#5595877; Oliphant et al . Gene 44:177 
(1986) . 

The term "aptamer" is conferred on those 
oligonucleotides which bind the target protein. Such 
aptamers may be used to characterize the target protein, 
both directly (through identification of the aptamer and the 
points of contact between the aptamer and the protein) and 
indirectly (by use of the aptamer as a ligand to modify the 
chemical reactivity of the protein) . 

In a classic oligonuclotide , each nucleotide (monomeric 
unit) is composed of a phosphate group, a sugar moiety, and 
either a purine or a pyrimidine base. In DNA, the sugar is 
deoxyribose and in RNA it is ribose . The nucleotides are 
linked by 5' -3' phosphodiester bonds. 
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The deoxyribose phosphate backbone , of DNA can be 
modified to increase resistance to nuclease and to increase 
penetration of cell membranes. Derivatives such as mono- or 
dithiophosphates , methyl phosphonates , boranophosphates , 
5 formacetals, carbamates, siloxanes, and dimethyl enethio- - 
sulfoxideo- and-sulfono- linked species are known in the 
art . 

Peptide Library 

10 A peptide is composed of a plurality of amino acid 

residues joined together by peptidyl (-NHCO-) bonds. A 
biogenic peptide is a peptide in which the residues are all 
genetically encoded amino acid residues; it is not necessary 
\that the biogenic peptide actually be produced by gene 

15 expression. 

Amino acids are the basic building blocks with which 
peptides and proteins are constructed. Amino acids possess 
both an amino group ( - NH 2 ) and a carboxylic acid group (- 
COOH) . Many amino acids, but not all, have the alpha amino 

2 0 acid structure NH 2 -CHR-COOH, where R is hydrogen, or any of a 

variety of functional . groups . 

Twenty amino acids are genetically encoded: Alanine, 
Arginine, Asparagine, Aspartic Acid, Cysteine, Glutamic 
Acid, Glutamine, Glycine, Histidine, Isoleucine, Leucine, 
25 Lysine, . Methionine, Phenylalanine, Proline, Serine, 

Threonine, Tryptophan, Tyrosine, and Valine. Of these, all 
save Glycine are optically isomeric, however, only the L- 
form is found in humans. Nevertheless, the D- forms of these 
amino acids do have biological significance; D-Phe, for 
30 example, is a known analgesic. 

Many other amino acids are also known, including: 2- 
Aminoadipic acid; 3 -Aminoadipic acid; beta-Aminopropionic 
acid; 2-Aminobutyric acid; 4 -Aminobutyric acid (Piperidinic 
acid) ; 6-Aminocaproic acid; 2 -Aminoheptanoic acid; 2- 

3 5 Aminoisobutyric acid, 3 -Aminoisobutyric acid; 2 -Aminopimelic 

acid; 2 , 4 -Diaminobutyric acid; Desmosine; 2,2'- 
Diaminopimelic acid; 2 , 3 -Diaminopropionic acid; N- 
Ethylglycine ; N-Ethylasparagine ; Hydroxylysine ; alio- 



WO 2(104/092416 



PCT/US2004/01019I 



Hydroxylysine ; 3 -Hydroxyproline ,- 4 -Hydroxyproline ; 
I s ode smo sine ; alio- Isoleucine ; N- Methyl glycine (Sarcosine) ; 
N-Methylisoleucine ; N- Methyl valine ; Norvaline; Norleucine; 
and Ornithine . 

5 Peptides are constructed by condensation of amino acids 

and/or smaller peptides. The amino group of one amino acid 
(or peptide) reacts with the carboxylic acid group of a 
second amino acid (or peptide) to form a peptide (-NHC0-) 
bond, releasing one molecule of water. Therefore, when an 

10 amino acid is incorporated into a peptide, it should, 
technically speaking, be referred to as an amino acid 
residue . The core of that residue is the moiety which 
excludes the -NH and -CO linking functionalities which 
connect it to other residues. This moiety consists of one 

15 or more main chain atoms (see below) and the attached side 
chains . 

The main chain moiety of each amino acid consists of 
the -NH and -CO linking functionalities and a core main 
chain moiety. Usually the latter is a single carbon atom. 

2 0 However, the core main chain moiety may include additional 

carbon atoms, and may also include nitrogen, oxygen or 
sulfur atoms, which together form a single chain. In a 
preferred embodiment, the core main chain atoms consist 
solely of carbon atoms . 
25 The side chains are attached to the core main chain 

atoms. For alpha amino acids, in which the side chain is 
attached to the alpha carbon, the C-l, C-2 and N-2 of each 
residue form the repeating unit of the main chain, and the 
word "side chain" refers to the C-3 and higher numbered 

3 0 carbon atoms and their subst ituents . It also includes H 

atoms attached to the main chain atoms . 

Amino acids may be classified according to the number 
of carbon atoms which appear in the main chain between the 
carbonyl carbon and amino nitrogen atoms which participate 
35 in the peptide bonds. Among the 150 or so amino acids which 
occur in nature, alpha, beta, gamma and delta amino acids 
are known. These have 1-4 intermediary carbons. Only alpha 
amino acids occur in proteins. Proline is a special case of 
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an alpha amino acid; its side chain also binds to the 
peptide bond nitrogen. 

For beta and higher order amino acids, there is a 
choice as to which main chain core carbon a side chain other 
5 than H is attached to. The preferred attachment site is the 
C-2 (alpha) carbon, i.e., the one adjacent to the carboxyl 
carbon of the -CO linking functionality. It is also possible 
for more than one. main chain atom to carry a side chain 
other than H. However, in a preferred embodiment, only one 
10 main chain core atom carries a side chain other than H. 

A main chain carbon atom may carry either one or two 
side chains; one is more common. A side chain may be 
attached to a main chain carbon atom by a single or a double 
bond; the former is more common. 
15 A simple combinatorial peptide library is one whose 

members are peptides having three or more amino acids 
connected via peptide bonds. 

The peptides may be linear, branched, or cyclic, and 
may covalently or noncovalently include nonpeptidyl 
20 moieties- The amino acids are not limited to the naturally 
occurring or to the genetically encoded amino acids. 

A biased peptide library is one in which one or more 
(but not all) residues of the peptides are constant 
residues . 

25 

Cyclic Peptides 

Many naturally occurring peptides are cyclic. 
Cyclization is a common mechanism for stabilization of 
peptide conformation thereby achieving improved association 

3 0 of the peptide with its ligand and hence improved biological 
activity. Cyclization is usually achieved by intra-chain 
cystine formation, by formation of peptide bond between side 
chains or between N- and C- terminals. Cyclization was 
usually achieved by peptides in solution, but several 

3 5 publications have appeared that describe cyclization of 
peptides on beads. 

A peptide library may be an oligopeptide library or a 
protein library. 
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Oligopeptides 

Preferably, the oligopeptides are at least five, six, 
seven or eight amino acids in length. Preferably, they are 
composed of less than 50, more preferably less than 20 amino 
5 acids. 

In the case of an oligopeptide library, all or just 
some of the residues may be variable. The oligopeptide may 
be unconstrained, or constrained to a particular 
conformation by, e.g., the participation of constant 
10 cysteine residues in the formation of a constraining 
disulfide bond. 

Proteins 

Proteins, like oligopeptides, are composed of a 
15 plurality of amino acids, but the term protein is usually 

reserved for longer peptides, which are able to fold into a 
stable conformation. A protein may be composed of two or 
more polypeptide chains, held together by covalent or 
noncovalent crosslinks. These may occur in a homooligomeric 

2 0 or a heterooligomeric state. 

A peptide is considered a protein if it (1) is at least. 
50 amino acids long, or (2) has at least two stabilizing 
covalent crosslinks (e.g., disulfide bonds). Thus, 
conotoxins are considered proteins . 
25 Usually, the proteins of a protein library will be 

characterizable as having both constant residues (the same 
for all proteins in the library) and variable residues 
(which vary from member to member) . This is simply because, 
for a given range of variation at each position, the 

3 0 sequence space (simple diversity) grows exponentially with 

the number of residue positions, so at some point it becomes 
inconvenient for all residues of a peptide to be variable 
positions . Since proteins are usually larger than 
oligopeptides, it is more common for protein libraries than • 
35 oligopeptide libraries to feature variable positions. 

In the case of a protein library, it is desirable to 
focus the mutations at those sites which are tolerant of 
mutation. These may be determined by alanine scanning 
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mutagenesis or by comparison of the protein sequence to that 
of homologous proteins of similar activity. It is also more 
likely that mutation of surface residues will directly 
affect binding. Surface residues may be determined by 
5 inspecting a 3D structure of the protein, or by labeling the 
surface and then ascertaining which residues have received 
labels. They may also be inferred by identifying regions of 
high hydrophilicity within the protein. 

Because proteins are often altered at some sites but 

10 not others, protein libraries can be considered a special 
case of the biased peptide library. 

There are several reasons that one might screen a 
protein library instead of an oligopeptide library, 
including (1) a particular protein, mutated in the library, 

15 has the desired activity to some degree already, and (2) the 
oligopeptides are not expected to have a sufficiently high 
affinity or specificity since they do not have a stable 
conf ormat ion . 

When the protein library is based on a parental protein 

2 0 which does not have the desired activity, the parental 
protein will usually be one which is of high stability 
(melting point >= 50 deg. C.) and/or possessed of 
hypei-variable regions . 

The variable domains of an antibody possess 

25 hypervariable regions and hence, in some embodiments, the 

protein library comprises members which comprise a mutant of 
VH or VL chain, or a mutant of an antigen- specif ic binding 
fragment of such a chain. VH and VL chains are usually each 
about 110 amino acid residues, and are held. in proximity by 

30 a disulfide bond between the adjoing CL and CHI regions to 
form a variable domain. Together, the VH, VL, CL and CHI 
form an Fab fragment . 

In human heavy chains, the hypervariable regions are at 
31-35, 49-65, 98-111 and 84-88, but only the first three are 

35 involved in antigen binding. There is variation among VH 

and VL chains at residues outside the hypervariable regions, 
but to a much lesser degree. 

A sequence is considered a mutant of a VH or VL chain 
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if it is at least 80% identical to a naturally occurring VH 
or VL chain at all residues outside the hypervariable 
region . 

In a preferred embodiment, such antibody library 
5 members comprise both at least one VH chain and at least one 
VL chain, at least one of which is a mutant chain, and which 
chains may be derived from the same or different antibodies. 
The VH and VL chains may be covalently joined by a suitable 
linker moiety, as in a "single chain antibody", or they may 
10 be noncovalently joined, as in a naturally occurring 
variable domain. 

If the joining is noncovalent, and the library is 
displayed on cells or virus, then either the VH or the VL 
chain may be fused to the carrier surface/coat protein. The 
15 complementary chain may be co-expressed, or added 
exogenously to the library. 

The members may further comprise some or all of an 
antibody constant heavy and/or constant light chain, or a 
mutant thereof . 

20 

Peptoid Library 

A peptoid is an analogue of a peptide in which one or 
more of the peptide bonds (-NH-CO-) are replaced by 
pseudopeptide bonds, which may be the same or different. It 
25 is not necessary that all of the peptide bonds be replaced, 
i.e., a peptoid may include one or more conventional amino 
acid residues, e.g., proline. 

A peptide bond has two small divalent linker elements, 
-NH- and -CO- . Thus, a preferred class of psuedopeptide 
3 0 bonds are those which consist of two small divalent linker 
elements. Each may be chosen independently from the group 
consisting of amine (-NH-) , substituted amine (-NR-) , 
carbonyl (-CO-), thiocarbonyl ( -CS- ), methylene (-CH2-), 
monosubstituted methylene (-CHR-) , disubsti tuted methylene 
35 (-CR1R2-), ether (-O-) and thioether (-S-). The more 

preferred pseudopeptide bonds include: 
N-modified -NRCO- 
Carba W -CH 2 -CH 2 - 
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Depsi W -CO-O- 
Hydroxyethylene W -CHOH-CH 2 - 
Ketomethylene ¥ -CO-CH 2 - 
Methylene -Oxy -CH 2 -0- 
Reduced -CH 2 -NH- 
Thiomethylene -CH 2 -S- 
Thiopeptide -CS-NH- 
Retro-Inverso -CO-NH- 

A single peptoid molecule may include more than one 
kind of pseudopeptide bond. 

For the purposes of introducing diversity into a 
peptoid library, one may vary (1) the side chains attached 
to the core main chain atoms of the monomers linked by the 
pseudopeptide bonds, and/or (2) the side chains (e.g., the - 
R of an -NRCO-) of the pseudopeptide bonds. Thus, in one 
embodiment, the monomeric units which are not amino acid 
residues are of the structure -NR1-CR2 -CO- , where at least 
one of Rl and R2 are not hydrogen. If there is variability 
in the pseudopeptide bond, this is most conveniently done by 
using an -NRCO- or other pseudopeptide bond with an R group, 
and varying the R group. In this event, the R group will 
usually be any of the side chains' characterizing the amino 
acids of peptides, as previously discussed. 

If the R group of the pseudopeptide bond is not 
variable, it will usually be small, e.g., not more than 10 
atoms (e.g., hydroxyl , amino, carboxyl , methyl, ethyl, 
propyl) . 

If the conjugation chemistries are compatible, a simple 
combinatorial library may include both peptides and 
peptoids . 

Peptide Nucleic Acid Library 

A PNA oligomer is here defined as one comprising a 
plurality of units, at least one of which is a PNA monomer 
which comprises a side chain comprising a nucleobase. For 
nucleobases, see USP 6,077,835. 

The classic PNA oligomer is composed of (2- 
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aminoethyl) glycine units, with nucleobases attached by 
methylene carbonyl linkers. That is, it has the structure 

H- (-HN-CH 2 -CH 2 -N(-CO-CH 2 -B) -CH 2 -CO-) n -OH 

where the outer parenthesized substructure is the PNA 
monomer . 

In this structure, the nucleobase B is separated from 
the backbone N by three bonds, and the points of attachment 
of the side chains are separated by six bonds. The 
nucleobase may be any of the bases included in the 
nucleotides discussed in connection with oligonucleotide 
libraries. The bases of nucleotides A, G, T, C and U are 
preferred. 

A PNA oligomer may further comprise one or more amino 
acid residues, especially glycine and proline. 

One can readily envision related molecules in which (1) 
the -COCH2- linker is replaced by another linker, especially 
one composed of two small divalent linkers as defined 
previously, (2) a side chain is attached to one of the three 
main chain carbons not participating in the peptide bond 
(either instead or in addition to the side chain attached to 
the N of the classic PNA); and/or (3) the peptide bonds are 
replaced by pseudopept ide bonds as disclosed previously in 
the context of peptoids . 

PNA oligomer libraries have been made; see e.g. Cook, 
6,204,326. 

Small Organic Compound Library 

The small organic compound library ("compound library", 
for short) is a combinatorial library whose members are 
suitable for use as drugs if, indeed, they have the ability 
to mediate a biological activity of the target protein. 

Peptides have certain disadvantages as drugs. These 
include susceptibility to degradation by serum proteases, 
and difficulty in penetrating cell membranes. Preferably, 
all or most of the compounds of the compound library avoid, 



WO 2004/092416 



PCT/US2004/010191 



50 

or at least do not suffer to the same degree, one or more of 
the pharmaceutical disadvantages of peptides. 

In designing a compound library, it is helpful to bear 
in mind the methods of molecular modification typically used 
to obtain new drugs. Three basic kinds of modification may 
be identified: disjunction , in which a lead drug is 
simplified to identify its component pharmacophoric 
moieties; conjunction , in which two or more known 
pharmacophoric moieties, which may be the same or different, 
are associated, covalently or noncovalently, to form a new 
drug; and alteration , in which one moiety is replaced by 
another which may be similar or different, but which is not 
in effect a disjunction or conjunction. The use of the 
terms "disjunction", "conjunction" and "alteration" is 
intended only to connote the structural relationship of the 
end product to the original leads, and not how the new drugs 
are actually synthesized, although it is possible that the 
two are the same . 

The process of disjunction is illustrated by the 
evolution of neostigmine (1931) and edrophonium (1952) from 
physostigmine (1925) . Subsequent conjunction is illustrated 
by demecarium (19 56) and ambenonium (1956) . 

Alterations may modify the size, polarity, or electron 
distribution of an original moiety. Alterations include 
ring closing or opening, formation of lower or higher 
homologues, introduction or saturation of double bonds, 
introduction of optically active centers, introduction, 
removal or replacement of bulky groups, isosteric or 
bioisosteric substitution, changes in the position or 
orientation of a group, introduction of alkylating groups, 
and introduction, removal or replacement of groups with a 
view toward inhibiting or promoting inductive 
(electrostatic) or conjugative (resonance) effects. 

Thus, the substituents may include electron acceptors 
and/or electron donors. Typical electron donors (+1) 
include -CH 3 , -CH 2 R, -CHR 2 , -CR 3 and -COO". Typical electron 
acceptors (-1) include -NH 3 +, -NR 3 +, -NO,, -CN, -COOH, -COOR, 
-CHO, -COR, -COR, -F, -CI, -Br, -OH, -OR, -SH, -SR, -CH=CH 2 , 
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-CR=CR 2 , and -C=CH. 

The substituents may also include those which increase 
or decrease electronic density in conjugated systems. The 
former (+R) groups include -CH 3 , -CR 3 , -F, -CI, -Br, -I, -OH, 
-OR, -OCOR, -SH, -SR, -NH 2 , -NR 2 , and -NHCOR. The later (-R) 
groups include -N0 2 , -CN, -CHC, -COR, -COOH, -COOR, -CONH 2 , 
-S0 2 R and -CF 3 . 

Synthetically speaking, the modifications may be 
achieved by a variety of unit processes, including 
nucleophilic and electrophilic substitution, reduction and 
oxidation, addition elimination, double bond cleavage, and 
cyclization . 

For the purpose of constructing a library, a compound, 
or a family of compounds, having one or more pharmacological 
activities (which need not be related to the known or 
suspected activities of the target protein) , may be 
disjoined into two or more known or potential pharmacophoric 
moieties. Analogues of each of these moieties may be 
identified, and mixtures of these analogues reacted so as to 
reassemble compounds which have some similarity to the 
original lead compound. It is not necessary that all 
members of the library possess moieties analogous to all of 
the moieties of the lead compound. 

The design of a library may be illustrated by the 
example of the benzodiazepines. Several benzodiazepine 
drugs, including chlordiazepoxide , diazepam and oxazepam, 
have been used as anti-anxiety drugs. Derivatives of 
benzodiazepines have widespread biological activities ; 
derivatives have been reported to act not only as 
anxiolytics, but also as anticonvulsants; cholecystokinin 
(CCK) receptor subtype A or B, kappa opioid receptor, 
platelet activating factor, and HIV transactivator Tat 
antagonists, and GPIIblla, reverse transcriptase and ras 
farnesyltransf erase inhibitors. 

The benzodiazepine structure has been disjoined into a 
2-aminobenzophenone, an amino acid, and an alkylating agent. 
See Bunin, et al . , Proc . Nat. Acad. Sci. USA, 91:4708 
(1994) - Since only a few 2-aminobenzophenone derivatives 
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are commercially available, it was later disjoined into 2- 
aminoarylstannane , an acid chloride, an amino acid, and an 
alkylating agent. Bunin, et al . , Meth. Enzymol . , 267:448 
(1996) . The arylstannane may be considered the core 
5 structure upon which the other moieties are substituted, or 
all four may be considered equals which are conjoined to 
make each library member. 

A basic' library synthesis plan and member structure is 
shown in Figure 1 of Fowlkes, et al . , U.S. Serial No. 

10 08/740,671, incorporated by reference in its entirety. The 

acid chloride building block introduces variability at the R 1 
site. The R 2 site is introduced by the amino acid, and the 
R 3 site by the alkylating agent. The R 4 site is inherent in 
the arylstannane. Bunin, et al . generated a 1, 4- 

15 benzodiazepine library of 11,20 0 different derivatives 
prepared from 20 acid chlorides, 35 amino acids, and 16 
alkylating agents. (No diversity was introduced at R 4 ,- this 
group was used to couple the molecule to a solid phase.) 
According to the Available Chemicals Directory {HDL 

2 0 Information Systems, San Leandro CA) , over 3 00 acid 

chlorides, 80 Fmoc -protected amino acids and 800 alkylating 
agents were available for purchase (and more, of course, 
could be synthesized) . The particular moieties used were 
chosen to maximize structural dispersion, while limiting the 
25 numbers to those conveniently synthesized in the wells of a 
microtiter plate. In choosing between structurally similar 
compounds, preference was given to the least substituted 
compound . 

The variable elements included both aliphatic and 
30 aromatic groups. Among the aliphatic groups, both acyclic 
and cyclic (mono- or poly-) structures, substituted or not, 
were tested. (While all of the acyclic groups were linear, 
it would have been feasible to introduce a branched 
aliphatic) . The aromatic groups featured either single and 

3 5 multiple rings, fused or not, substituted or not, and with 

heteroatoms or not. The secondary substitutents included - 
NH 2 , -OH, -OMe, -CN, -CI, -F, and -COOH. While not used, 
spacer moieties, such as -O-, -S-, -OO- , -CS-, -NH-, and - 
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NR- , could have been incorporated. 

Bunin et al . suggest that instead of using a 1, 4- 
benzodiazepine as a core structure, one may instead use a 1, 
4 -benzodiazepine-2 , 5-dione structure. 
5 As noted by Bunin et al . , it is advantageous, although 

not necessary, to use a linkage strategy which leaves no 
trace of the linking functionality, as this permits 
construction of a more diverse library. 

Other combinatorial nonoligomeric compound libraries 
10 known or suggested in the art have been based on carbamates, 
mercaptoacylated pyrrolidines, phenolic agents, aminimides, 
N-acylamino ethers (made from amino alcohols, aromatic 
hydroxy acids, and carboxylic acids), N-alkylamino ethers 
(made from aromatic hydroxy acids, amino alcohols and 
15 aldehydes) 1, 4 -piperazines , and 1, 4-piperazine-6-ones . 

DeWitt, et al., Proc . Nat. Acad. Sci . (USA), 90:6909-13 
(1993) describe the simultaneous but separate, synthesis of 
40 discrete hydantoins and 40 discrete benzodiazepines. 
They carry out their synthesis on a solid support (inside a 
20 gas dispersion tube), in an array format, as opposed to 

other conventional simultaneous synthesis techniques (e.g., 
in a well, or on a pin) . The hydantoins were synthesized by 
first simultaneously deprotecting and then treating each of 
five amino acid resins with each of eight isocyanates . The 
25 benzodiazepines were synthesized by treating each of five 
deprotected amino acid resins with each of eight 2 -amino 
benzophenone imines . 

Chen, et al . , J. Am. Chem. Soc . , 116:2661-62 (1994) 
described the preparation of a pilot (9 member) 
3 0 combinatorial library of formate esters. A polymer bead- 
bound aldehyde preparation was "split" into three aliquots, 
each reacted with one of three different ylide reagents. 
The reaction products were combined, and then divided into 
three new aliquots, each of which was reacted with a 
35 different Michael donor. Compound identity was found to be 
determinable on a single bead basis by gas 
chromatography /mass spectroscopy analysis. 

Holmes, USP 5,549,974 (1996) sets forth methodologies 
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for the combinatorial synthesis of libraries of 
thiazolidinones and metathiazanones . These libraries are 
made by combination of amines, carbonyl compounds, and 
thiols under cyclization conditions. 

Ellman, USP 5,545,568 (1996) describes combinatorial 
synthesis of benzodiazepines, prostaglandins, beta-turn 
mimetics, and glycerol -based compounds. See also Ellman, 
USP 5,288,514. 

Summerton, USP 5,506,33 7 (1996) discloses methods of 
preparing a combinatorial library formed predominantly of 
morpholino subunit structures. 

Heterocylic combinatorial libraries are reviewed 
generally in Nefzi, et al. ; , Chem. Rev., 97:449-472 (1997). 

For pharmacological classes, see, e.g., Goth, Medical 
Pharmacology: Principles and Concepts (C.V. Mosby Co . : 8th 
ed. 197 6) ,- Korolkovas and Burckhalter, Essentials of 
Medicinal Chemistry (John Wiley & Sons, Inc. : 1976) . For 
synthetic methods, see, e.g., Warren, Organic Synthesis: The 
Disconnection Approach (John Wiley & Sons, Ltd. : 1982) ; 
Fuson, Reactions of Organic Compounds (John Wiley & Sons: 
1966) ; Payne and Payne, How to do an Organic Synthesis 
(Allyn and Bacon, Inc. : 1969) ; Greene, Protective Groups in 
Organic Synthesis (Wiley-Interscience) . For selection of 
substituents, see e.g., Hansch and Leo, Substituent 
Constants for Correlation Analysis in Chemistry and Biology 
(John Wiley & Sons: 1979) . 

The library is preferably synthesized so that the 
individual members remain identifiable so that, if a member 
is shown to be active, it is not necessary to analyze it. 
Several methods of identification have been' proposed, 
including : 

(1) encoding, i.e., the attachment to each member of 
an identifier moiety which is more readily 
identified than the member proper. This has the 
disadvantage that the tag may itself influence the 
activity of the conjugate. 

(2) spatial addressing, e.g., each member is 
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synthesized only at a particular coordinate on or 
in a matrix, or in a particular chamber. This 
might be, for example, the location of a 
particular pin, or a particular well on a 
microtiter plate, or inside a "tea bag". 
The present invention is not limited to any particular form 
of identification. 

However, it is possible to simply characterize those 
members of the library which are found to be active, based 
on the characteristic spectroscopic indicia of the various 
building blocks. 

Solid phase synthesis permits greater control over 
which derivatives are formed. However, the solid phase 
could interfere with activity. To overcome this problem, 
some or all of the molecules of each member could be 
liberated, after synthesis but before screening. 

Examples of candidate simple libraries which might be 
evaluated include derivatives of the following: 
Cyclic Compounds Containing One Hetero Atom 
Heteronitrogen 
pyrroles 

pentasubstituted pyrroles 
pyrrol idines 
pyrrol ines 
prolines 
indoles 

beta -carbol ines 
pyridines 

d i hydr opyr i d i ne s 

1 , 4 -dihydropyridines 

pyrido [2 , 3 -d] pyrimidines 

tetrahydro-3H-imidazo [4 , 5-c] pyridines 
Isoquinolines 

tetrahydroisoquinolines 
quinolones 
beta-lactams 

azabicyclo [4 . 3 . 0] nonen-8-one amino acid 
Heterooxygen 
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furans 

tetrahydrof urans 

2 , 5-disubstituted tetrahydrof urans 

pyrans 

hydroxypyranones 
t e t r ahydroxypyr anone s 
gamma -butyrolactones 
Heterosulfur - 
sulfolenes 

Cyclic Compounds with Two or More Hetero atoms 
Multiple heteronitrogens 
imidazoles 
pyrazoles 
piperazines 

diketopiperazines 
arylpiperazines 
benzylpiperazines 
benzodiazepines 

1 , 4-benzodiazepine-2 , 5-diones 
hydantoins 

5 - a 1 koxyhydant o i ns 
d i hy dr opy r i mi d i ne s 

1 , 3-disubstituted-5 , 6-dihydopyrimidine-2 , 4- 

diones 
cyclic ureas 
cyclic thioureas 
quinazolines 

chiral 3 -substituted-quinazoline-2 , 4- 

es 

triazoles 

1, 2, 3-triazoles 
purines 

Heteronitrogen and Heterooxygen 

dikelomorpholines 

isoxazoles 

isoxazolines 
Heteronitrogen and Heterosulfur 
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thiazolidines 

N-axylthiazolidines 
dihydrothiazoles 

2- methylene-2 , 3 -dihydrothiazates 
5 2 -aminothiazoles 

thiophenes 

3 - amino thiophenes 
4 -thiazolidinones 

4 -melathiazanones 
10 benzisothiazolones 

For details on synthesis of libraries, see Nefzi, et 
al . , Chem. Rev., 97:449-72 (1997), and references cited 
therein. 

15 Pharmaceutical Methods and Preparations 

The preferred animal subject of the present invention 
is a mammal. By the term "mammal" is meant an individual 
belonging to the class Mammalia. The invention is 
particularly useful in the treatment of human subjects, 

20 although it is intended for veterinary and nutritional uses 
as well. Preferred nonhuman subjects are of the orders 
Primata (e.g., apes and monkeys), Artiodactyla or 
Perissodactyla (e.g., cows, pigs, sheep, horses, goats), 
Carnivora (e.g., cats, dogs), Rodenta (e.g., rats, mice, 

25 guinea pigs, hamsters), Lagomorpha (e.g., rabbits) or other 
pet, farm or laboratory mammals. 

The term "protection", as used herein, is intended to 
include "prevention," "suppression" and "treatment." 
"Prevention", strictly speaking, involves administration of 

3 0 the pharmaceutical prior to the induction of the disease (or 
other adverse clinical condition) . "Suppression" involves 
administration of the composition prior to the clinical 
appearance of the disease. "Treatment" involves 
administration of the protective composition after the 

35 appearance of the disease. 

It will be understood that in human and veterinary 
medicine, it is not always possible to distinguish between 
"preventing" and "suppressing" since the ultimate inductive 
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event or events may be unknown, latent, or the patient is 
not ascertained until well after the occurrence of the event 
or events. Therefore, unless qualified, the term 
"prevention" will be understood to refer to both prevention 
5 in the strict sense, and to suppression. 

The preventative or prophylactic use of a 
pharmaceutical involves identifying subjects who are at 
higher risk than the general population of contracting the 
disease, and administering the pharmaceutical to them in 

10 advance of the clinical appearance of the disease. The 
effectiveness of such use is measured by comparing the 
subsequent incidence or severity of the disease, or of 
particular symptoms of the disease, in the treated subjects 
against that in untreated subjects of the same high risk 

1 5 group . 

While high risk factors vary from disease to disease, 
in general, these include (1) prior occurrence of the 
disease in one or more members of the same family, or, in 
the case of a contagious disease, in individuals with whom 
2 0 the subject has come into potentially contagious contact at 
a time when the earlier victim was likely to be contagious, 
(2) a prior occurrence of the disease in the subject, (3) 
prior occurrence of a related disease, or a condition known 
to increase the likelihood of the disease, in the subject ; 

2 5 (4) appearance of a suspicious level of a marker of the 

disease, or a related disease or condition; (5) a subject 
who is immunologically compromised, e.g., by radiation 
treatment, HIV infection, drug use,, etc., or (6) membership 
in a particular group (e.g., a particular age, sex, race, 

3 0 ethnic group, etc.) which has been epidemiologically 

associated with that disease. 

A prophylaxis or treatment may be curative, that is, 
directed at the underlying cause of a disease, or 
ameliorative, that is, directed at the symptoms of the 
35 disease, especially those which reduce the quality of life. 

It should also be understood that to be useful, the 
protection provided need not be absolute, provided that it 
is sufficient to carry clinical value. An agent which 
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provides protection to a lesser degree than do competitive 
agents may still be of value if the other agents are 
ineffective for a particular individual, if it can be used 
in combination with other agents to enhance the level of 
5 protection, or if it is safer than competitive agents. It is 
desirable that there be a statistically significant (p=0 . 05 
or less) improvement in the treated subject relative to an 
appropriate untreated control, and it is. desirable that this 
improvement be at least 10%, more preferably at least 25%, 

10 still more preferably at least 50%, even more preferably at 
least 100%, in some indicia of the incidence or severity of 
the disease or of at least one symptom of the disease. 

At least one of the . drugs of the present invention may 
be administered, by any means that achieve their intended 

15 purpose, to protect a subject against a disease or other 
adverse condition. The form of administration may be 
systemic or topical. For example, administration of such a 
composition may be by various parenteral routes such as 
subcutaneous, intravenous, intradermal, intramuscular, 

2 0 intraperitoneal, intranasal, transdermal, or buccal routes. 

Alternatively, or concurrently, administration may be by the 
oral route. Parenteral administration can be by bolus 
injection or by gradual perfusion over time. 

A typical regimen comprises administration of an 
25 effective amount of the drug, administered over a period 
ranging from a single dose, to dosing over a period of 
hours, days, weeks, months, or years. 

It is understood that the suitable dosage of a drug of 
the present invention will be dependent upon the age, sex, 

3 0 health, and weight of the recipient, kind of concurrent 

treatment, if any, frequency of treatment, and the nature of 
the effect desired. However, the most preferred dosage can 
be tailored to the individual subject, as is understood and 
determinable by one of skill in the art, without undue 
35 experimentation. This will typically involve adjustment of 
a standard dose, e.g., reduction of the dose if the patient 
has a low body weight. 

Prior to use in humans, a drug will first be evaluated 
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for safety and efficacy in laboratory animals. In human 
clinical studies, one would begin with a dose expected to be 
safe in humans, based on the preclinical data for the drug 
in question, and on customary doses for analogous drugs (if 
any) . If this dose is effective, the dosage may be 
decreased, to determine the minimum effective dose, if 
desired. If this dose is ineffective, it will be cautiously 
increased, with the patients monitored for signs of side 
effects. See, e.g., Berkow et al , eds . , The Merck Manual, 
15th edition, Merck and Co., Rahway, N.J., 1987; Goodman et 
al . , eds., Goodman and Gilman's The Pharmacological Basis of 
Therapeutics, 8th edition, Pergamon Press, Inc., Elmsford, 
N.Y., (1990); Avery's Drug Treatment: Principles and 
Practice of Clinical Pharmacology and Therapeutics , 3rd 
edition, ADIS Press, LTD., Williams and Wilkins, Baltimore, 
MD. (1987), Ebadi, Pharmacology, Little, Brown and Co., 
Boston, (1985) , which references and references cited 
therein, are entirely incorporated herein by reference. 

The total dose required for each treatment may be 
administered by multiple doses or in a single dose. The 
protein may be administered alone or in conjunction with 
other therapeutics directed to the disease or directed to 
other symptoms thereof . 

rypica] pharmac •■v.t; i c .1 ioses oj adul t human; , u ■-_ r> 
the - range of ] nc - lOg pe Im\ more . \ t e i IjTig ; i^p„r 

The appropriate dosage form will depend on the disease, 
the pharmaceutical, and the mode of administration; 
possibilities include tablets, capsules, lozenges, dental 
pastes, suppositories, inhalants, solutions, ointments and 
parenteral depots. See, e.g., Berker, supra, Goodman, 
supra, Avery, supra and Ebadi, supra, which are entirely 
incorporated" herein by reference, including all references 
cited therein. 

In the case of peptide drugs, the drug may be 
administered in the form of an expression vector comprising 
a nucleic acid encoding the peptide; such a vector, after 
incorporation into the genetic complement of a cell of the 
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patient, directs synthesis of the peptide. Suitable vectors 
include genetically engineered poxviruses (vaccinia) , 
adenovimses, adeno-associated viruses, herpesviruses and 
lentiviruses which are or have been rendered nonpathogenic. 
5 In addition to at least one drug as described herein, a 

pharmaceutical composition may contain suitable 
pharmaceut ically acceptable carriers, such as excipients, 
carriers and/or auxiliaries which facilitate processing of 
the active compounds into preparations which can be used 
10 pharmaceutical ly. See, e.g., Berker, supra., Goodman, supra, 
Avery, supra and Ebadi , supra, which are entirely 
incorporated herein by reference, included all references 
cited therein. 

15 Assay Compositions and Methods 

Target Organism 

The invention contemplates that it may be appropriate 
to ascertain or to mediate the biological activity of a 
substance of this invention in a target organism. 
2 0 The target organism may be a plant, animal, or 

microorganism. 

In the case of a plant, it may be an economic plant, in 
which case the drug may be intended to increase the disease, 
weather or pest resistance, alter the growth 

2 5 characteristics, or otherwise improve the useful 

characteristics or mute undesirable characteristics of the 
plant. Or it may be a weed, in which case the drug may be 
intended to kill or otherwise inhibit the growth of the 
plant, or to alter its characteristics to convert it from a 

3 0 . weed to an economic plant. The plant may be a tree, shrub, 

crop, grass, etc. The plant may be an algae (which are in 
some cases also microorganisms), or a vascular plant, 
especially gymnosperms (particularly conifers) and 
angiosperms . Angiosperms may be monocots or dicots. The 
35 plants of greatest, interest are rice, wheat, corn, alfalfa, 
soybeans, potatoes, peanuts, tomatoes, melons, apples, 
pears, plums, pineapples, fir, spruce, pine, cedar, and oak. 
If the target organism is a microorganism, it may be 
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electrodes in the chip were used to create electrokinet ic 
forces capable of driving molecules through these micro- 
channels to. perform electrophoret ic separations. Ribosomal 
peaks were measured by fluorescence signal and displayed in 
5 an elect ropherogram. A successful total RNA sample featured 
2 distinct ribosomal peaks (18S and 28S rRNA) . 

Biotinylated cRNA Hybridization Target. 

Total RNA was prepared for use as a hybridization 
10 target as described in the manufacturer' s instructions for 
CodeLink Expression Bioarrays (TM) (Amersham Biosciences) . 
The CodeLink Expression Bioarrays utilize nucleic acid 
hybridization of a biotin-labeled complementary RNA(cRNA) 
target with DNA oligonucleotide probes attached to a gel 
15 matrix. 

The biotin-labeled cRNA target is prepared by a linear 
amplification method. Poly (A) + RNA (within the total RNA 
population) is primed for reverse transcription by a DNA 
.oligonucleotide containing a T7 RNA polymerase promoter 5' 
2 0 to a (dT) 24 sequence. After second-strand cDNA synthesis, 

the cDNA serves as the template in an in vitro transcription 
(IVT) reaction to produce the target cRNA. The IVT is 
performed in the presence of biotinylated nucleotides to 
label the target cRNA. This procedure results in a 50-2 00 
25 fold linear amplification of the input poly (A) + RNA. 

Hybridization Probes . 

The oligonucleotide probes were provided by the 
Codelink Uniset Mouse I Bioarray (Amersham, product code 

30 300013). Amine-terminated oligonucleotide probes are 

attached to a three-dimensional polyacrylamide gel matrix. 
There are 10,000 oligonucleotide probes, each specific to a 
well-characterized mouse gene. Each mouse gene is 
representative of a unique gene cluster from the fourth 

35 quarter 2001 Genbank Unigene build. There are also 500 
control probes . 

The sequences of the probes is proprietary to Amersham. 
However, for each probe, Amersham identifies the 
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corresponding mouse gene by NCBI accession number, OGS, 
LocusLink, Unigene Cluster ID, and description (name) . 
This information should be available from Amersham. In the 
case of the differentially expressed probes, this 
5 information is duplicated in master table 1. For the 
complete list, see 

http : //www4 . amershambiosciences . com/aptrix/upp010 77 .nsf/Cont 
ent/codelink_literature 

10 Under "Gene Lists" , select "Uniset Human I" , and a gene 
list, in Excel format, can be downloaded. 

Hybridization 

Using the cRNA target, the hbridization reaction 
15 mixture is prepared and loaded until array chambers for 
bioarray processing as set forth in the manufacturer's 
instructions for CodeLink Gene Expression BioarraysTM 
(Amerhsam Biosciences) . Each sample is hybridized to an 
individual microarray. Hybridization is at 3 7°C. The 
2 0 hybridization buffer is prepared as set forth in the 

Motorola instructions. Hybridization to the microarray is 
detected with an avidinated fluorescent reagent, 
Streptavidin-Alexa Fluor ® 647 (Amersham) . 

2 5 Mouse Gene Expression Analysis 

Processed arrays were scanned using a GenePix 4000B 
Microarray Scanner (Axon Instruments, Inc.); array images 
were acquired using the Amersham CodeLink™ Analysis Software 
(Release 2.2). The Amersham CodeLink™ Analysis Software 

3 0 gives an integrated optical density (IOD) value for every 

spot; a unique background value for that spot is subtracted, 
resulting in "raw" data points. Individual chips are then 
normalized by the Amersham Codelink™ software according to 
the median raw intensity for all 10,000 genes. A negative 
35 control threshold is also calculated according to the 

control probes. A significant difference in expression 
between samples was defined as a minimum of 2 -fold change in 
expression values. Genes with expression values below the 
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negative control threshold were eliminated from the analysis 
and then the expression data was analyzed to identify genes 
whose expression levels changed significantly with respect 
to: 

5 

Normal mice compared to hyperinsulinemic mice at 2 , 4, 
8 and 16 weeks, and 6 months, on normal vs. high-fat 
diet. 

10 Normal mice compared to hyperinsulinemic/hyperglycemic 

mice at 2, 4, 8 and 16 weeks, and 6 months, on normal 
vs. high-fat diet. 

Hyperinsulinemic compared to 
15 hyperinsulinemic/hyperglycemic mice at 2 , 4, 8 and 16 

weeks, and 6 months, on high- fat diets. 

Database Searches Nucleotide sequences and predicted amino 
acid sequences were compared to public domain databases 

2 0 using the Blast 2.0 program (National Center for 

Biotechnology Information, National Institutes of Health) . 
Nucleotide sequences were displayed using ABI prism Edit 
View 1.0.1 (PE Applied Biosystems, Foster City, CA) . 

Nucleotide database searches were conducted with the 
25 then current version of BLASTN 2.0.12, see Altschul, et al . , 
"Gapped BLAST and PSI -BLAST: a new generation of protein 
database search programs", Nucleic Acids Res., 25:3389-3402 
(1997) . Searches employed the default parameters, unless 
otherwise stated. 

3 0 For blastN searches, the default was the blastN matrix 

(1,-3), with gap penalties of 5 for existence and 2 for 
extension. 

Protein database searches were conducted with the then- 
current version of BLAST X, see- Altschul et al . (1997), 
3 5 supra . Searches employed the default parameters, unless 

otherwise stated. The scoring matrix was BLOSUM62 , with gap 
costs of 11 for existence and 1 for extension. The standard 
low complexity filter was used. 
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"ref" indicates that NCBI ' s RefSeq is the source 
database. The identifier that follows is a RefSeq accession 
number, not a GenBank accession number. "RefSeq sequences 
are derived from GenBank and provide non- redundant curat ed 
5 data representing our current knowledge of known genes. Some 
records include additional sequence information that was 
never submitted to an archival database but is available in 
the literature. A small number of sequences are provided 
through collaboration; the underlying primary sequence data 

10 is available in GenBank, but may not be available in any one 
GenBank record. RefSeq sequences are not submitted primary 
sequences. RefSeq records are owned by NCBI and therefore 
can be updated as needed to maintain current annotation or 
to incorporate additional sequence information." See also 

15 ht to : / /www . ncbi . nlm . nih . gov/LocusLink/refsea . html 

It will be appreciated by those in the art that the 
exact results of a database search will change from day to. 
day, as new sequences are added. Also, if you query with a 
longer version of the original sequence, the results will 

20 change. The results given here were obtained at one time 
and no guarantee is made that the exact same hits would be 
obtained in a search on the filing date. However, if an 
alignment between a particular query sequence and a 
particular database sequence is discussed, that alignment 

2 5 should not change (if the parameters and sequences remain 
unchanged) . 



Northern Analysis . 

30 Northern analysis may be used to confirm the results. 

Favorable and unfavorable genes, identified as described 
above, or fragments thereof, will be used as probes in 
Northern hybridization analyses to confirm their 
differential expression. Total RNA isolated from Control, 

35 Hyper insulinemic and Type- I I Diabetic mice will be resolved 
by agarose gel electrophoresis through a 1% agarose, 1 % 
formaldehyde denaturing gel, transferred to positively 
charged nylon membrane, and hybridized to a probe labeled 
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with [32P] dCTP that was generated from the aforementioned 
gene or fragment using the Random Primed DNA Labeling Kit 
(Roche, Palo Alto, CA) . 

5 Real-Time RNA Analysis. 

Real-time RNA analysis may also be used for 
confirmation. For "real-time" RNA analysis, RNA will be 
converted to cDNA and then probed with gene-specific primers 
made for each clone. "Real-time" incorporation of 

10 fluorescent dye will be measured to determine the amount of 
specific transcript present in each sample. Sample 
differences (control vs. hyperinsulinemic, hyperinsulinemic 
vs. diabetic, or control vs. diabetic) of 2-fold or greater 
(in either direction) will be considered differentially 

15 expressed. Confirmation using several independent animals 
is desirable. 

In situ Hybridization 

Another form of confirmation may be provided by 

20 nonisotopic in situ hybridizations (NISH) on selected human 
(obtained by Tissue Informatics) and mouse tissues using 
cRNA probes generated from mouse genes found to be up- or 
down- regulated during the disease progression. Nonisotopic 
in situ hybridizations may also be performed on mouse 

25 tissues using cRNA probes generated from all "novel" cDNA's 
identified through PCR subtractive hybridizations. These 
cRNA' s will hybridize to their corresponding messenger RNA' s 
present in cells and will provide information regarding the 
particular cell types within a tissue that is expressing the 

3 0 particular gene as well as the relative level of gene 

expression. The cRNA probes may be generated by in vitro 
transcription of template cDNA by Sp6 or .T7 RNA polymerase 
in the presence of digoxigenin-ll-UTP (Roche Molecular 
Biochemicals , Mannheim, Germany; Pardue, M.L. 1985. In: In 

35 situ hybridization, Nucleic acid hybridization, a practical 
approach: IRL Press, Oxford, 179-202) . 

Transgenic Animals. 



WO 2004/092416 



PCT/US2004/0 10191 



83 

Transgenic expression may be used to confirm the results. 
In one embodiment , a mouse is engineered to overexpress the 
favorable or unfavorable mouse gene in question. In another 
embodiment, a mouse is engineered to express the 
corresponding favorable or unfavorable human gene. In a 
third embodiment, a nonhuman animal other than a mouse, such 
as a rat, rabbit, goat, sheep or pig, is engineered to 
express the favorable or unfavorable mouse or human gene. 

Hyperquantitative Tissue Analysis 

In addition to gene expression analysis the liver sections 
can also be analyzed using Tissuelnf ormatics , Inc's 
TissueAnalytics™ software. A single representative section 
may be cut from each liver block, placed on a slide, and 
stained with H&E . Digital images of each slide may be 
acquired using an research microscope and digital camera 
(Olympus E600 microscope and Sony DKC-ST5) . These images 
were acquired at 2 Ox magnification with a resolution of 0.64 
mm/pixel. A hyperquantitative analysis may be performed on 
the resulting images: First a digital image analysis can 
identify and annotate structural objects in a tissue using 
machine vision. These objects, that are constituents of the 
tissue, can be annotated because they are visually 
identifiable and have a biological meaning like hepatocytes, 
sinusoids, vacuoles. Subsequently a quantification of these 
structures regarding their geometric properties like area or 
stain intensities and their relationship to the field of 
view or per unit area in terms of a % coverage may be 
performed. Features or parameters for hyper-quantification 
are specific for each tissue, and may also include relations 
between features, measures of overall heterogeneity, 
including orientation, relative locations, and textures. 

Correlation Analysis 

Mathematical statistics provides a rich set of additional 
tools to analyze time resolved data sets of hyper- 
quantitative and gene expression profiles for similarities, 
including rank correlation, the calculation of regression 
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and correlations coefficients, and clustering. Continuous 
functions may also be fitted through the data points of 
individual gene and tissue feature data. Relation between 
gene expression and hyper -quantitative tissue data may be 
5 linear or non- linear, in synchronous or asynchronous 
arrangements . 

A Spearman rank correlation analysis using was done on the 
2 classes of measurements (Genes and Tissues Features) to 
help identify other significant genes. A small number of 
10 genes that did not meet the 2 -Fold difference for 

significance were added to the list of genes based on their 
correlation with tissue features. i 



15 
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Citation of documents herein is not intended as an 
admission that any of the documents cited herein is 
pertinent prior art, or an admission that the cited 
documents is considered material to the patentability of: any 
5 of the claims of the present application . All statements as 
to the date or representation as to the contents of these 
documents is based on the information available to the 
applicant and does not constitute any admission as to the 
correctness of the dates or contents of these documents . 

10 The appended claims are to be treated as a non-limiting- 

recitation of preferred embodiments. 

In addition to those set forth elsewhere, the following 
references are hereby incorporated by reference, in their 
most recent editions as of the time of filing of this 

15 application: Kay, Phage Display of Peptides and Proteins:. A 
Laboratory Manual; the John Wiley and Sons Current Protocols 
series, including Ausubel , Current Protocols in Molecular 
Biology; Coligan, Current Protocols in Protein Science ; 
Coligan, Current Protocols in Immunology; Current Protocols 

2 0 in Human Genetics ; Current Protocols in Cytometry; Current 
Protocols in Pharmacology; Current Protocols in 
Neuroscience ; Current Protocols in Cell Biology; Current 
Protocols in Toxicology; Current Protocols in Field 
Analytical Chemistry ; Current Protocols in Nucleic Acid 

2 5 Chemistry; and Current Protocols in Human Genetics ; and 

the following Cold Spring Harbor Laboratory publications : 
Sambrook, Molecular Cloning: A Laboratory Manual; Harlow, 
Antibodies : A Laboratory Manual; Manipulating the Mouse 
Embryo: A Laboratory Manual; Methods in Yeast Genetics : A 

3 0 Cold Spring Harbor Laboratory Course Manual; Drosophila 

Protocols ; Imaging Neurons : A Laboratory Manual; Early 
Development of Xenopus laevis: A Laboratory Manual; Using 
Antibodies : A Laboratory Manual; At the Bench: A Laboratory 
Navigator; Cells: A Laboratory Manual; Methods in Yeast 
3 5 Genetics : A Laboratory Course Manual; Discovering Neurons: 
The Experimental Basis of Neuroscience ; Genome Analysis : A 
Laboratory Manual Series ; Laboratory DNA Science; 
Strategies for Protein Purification and Characterization: A 
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Laboratory Course Manual; Genetic Analysis of Pathogenic 
Bacteria: A Laboratory Manual; PCR Primer: A Laboratory 
Manual; Methods in Plant Molecular Biology: A Laboratory 
Course Manual ; Manipulating the Mouse Embryo: A Laboratory 
5 Manual; Molecular Probes of the Nervous System; Experiments 
with Fission Yeast: A Laboratory Course Manual; A Short 
Course in Bacterial Genetics : A Laboratory Manual and 
Handbook for Escherichia coli and Related Bacteria; DNA 
Science : A First Course in Recombinant DNA Technology ; 

10 Methods in Yeast Genetics : A Laboratory Course Manual; 

Molecular Biology of Plants: A Laboratory Course Manual. 

All references cited herein, including journal articles or 
abstracts, published, corresponding , prior or otherwise 
related U.S. or foreign patent applications, issued U.S. or 

15 foreign patents, or any other references, are entirely 
incorporated by reference herein, including all data, 
tables, figures, and text presented in the cited references. 
Additionally, the entire contents of the references cited 
within the references cited herein are also entirely 

2 0 incorporated by reference . 

Reference to known method steps, conventional methods 
steps, known methods or conventional methods is not in any 
way an admission that any aspect, description or embodiment 
of the present invention is disclosed, taught or suggested 

2 5 in the relevant art. 

The foregoing description of the specific embodiments will 
so fully reveal the general nature of the invention that 
others can, by applying knowledge within the skill of the 
art ( including the contents of the references cited herein), 

3 0 readily modify and/or adapt for various applications such 

specific embodiments, without undue experimentation, without 
departing from the general concept of the' present invention. 
Therefore, such adaptations and modifications are intended 
to be within the meaning and range of equivalents of the 
35 disclosed embodiments, based on the teaching and guidance 
presented herein. It is to be understood that the 
phraseology or terminology herein is for the purpose of 
description and not of limitation, such that the terminology 
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or phraseology of the present specification is to be 
interpreted by the skilled artisan in light of the teachings 
and guidance presented herein, in combination with the 
knowledge of one of ordinary skill in the art. 

Any description of a class or range as being useful or 
preferred in the practice of the invention shall be deemed a 
description of any subclass (e.g., a disclosed class with 
one or more disclosed members omitted) or subrange contained 
therein, as well as a separate description of each 
individual member or value in said class or range. 

The description of preferred embodiments individual ly 
shall be deemed a description of any possible combination of 
such preferred embodiments, except for combinations which 
are impossible (e.g, mutually exclusive choices for an 
element of the invention) or which are expressly excluded, by 
this specification. 

If an embodiment of this invention is disclosed in the 
prior art, the description of the invention shall be deemed 
to include the invention as herein disclosed with such 
embodiment excised. 
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Introduction to Master Tables 

The master tables reflect applicants' analysis of the gene 
chip data. 

5 

For each probe corresponding to a differentially expressed 
mouse gene, Master Table 1 identifies 

Col. 1: The mouse gene (upper) and mouse protein (lower) 
10 database accession #s . 

Col. 2: The corresponding mouse Unigene Cluster, as of the 
4 th Quarter 2 0 01 build. 

15 Col. 3: The behavior (differential expression) observed for 
the mouse gene. This column identifies the gene as 
favorable (F) or unfavorable (U) on the basis of its 
differential behavior. There are three possible 
comparisons, HI-D, C-HI, and C-D, where C=control (normal) , 

2 0 HI=hypex-insulinemic , and D=diabetic . 

If the level of the gene in the former state is at least 
two-fold that in the latter state, it is considered 
unfavorable. If the level of the gene in the former, state 
is not more than half (i.e., not more than negative two 
25 fold) that in the latter state, it is considered favorable. 

Col. 4: A related human protein, identified by its database 
accession number. Usually, several such proteins are 
identified relative to each mouse gene. These proteins have 

3 0 been identified by BLAST searches, as explained in cols. 6- 

8 . 

Col. 5: The name of the related human protein. 

35 Col. 6: The score (in bits) for the alignment performed by 
the BLAST program. 
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Col. 7: The E-value for the alignment performed by the BLAST 
program. It is worth noting that Unigene considers a Blastx 
E Value of less than le-6 to be a "match" to the reference 
sequence of a cluster. 

5 

Col. 8: The BLAST search strategy used. MG indicates that 
the mouse gene was used as the query sequence in a BlastX 
search. MP means that the mouse protein was used as the 
query sequence in a BlastP search.. HGP means that first the 
10 mouse gene was used in a BlastN search for a human gene, and 
then the human gene was used in a BLASTX search for the 
human protein. 

15 

Master Table 1 is divided into three subtables on the basis 
of the Behavior" in col. 3. If a gene has at least one 
favorable behavior, and no unfavorable ones, it is put into 
Subtable 1A. In the opposite case, it is put into Subtable 
20 IB. If its behavior is mixed, i.e., at least one favorable 
and at least one unfavorable, it is put into Subtable 1C. 

Master Table 2 has just three columns. 

2 5 Col. 1: Mouse gene. 

Col. 2: behavior. Same as col. 3 in Master table 1. 

Col. 3: Human protein classes. Based on the related human 
30 proteins defined in Master Table 1, Master Table 2 

generalizes, if possible as to classes of human proteins 
which are expected to have similar behavior. For a given 
mouse gene, several human protein classes may be listed 
because of the diversity of the human proteins found to be 
35 related. In some cases, the stated human protein classes 

may be hierarchial , e.g., one may be a subset of another. In 
other cases, the stated classes may be non- over lapping but 
related. And in yet other cases, the stated classes may be 
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algae, bacteria, fungi, or a virus (although the . biological 
activity of a virus must be determined in a virus-infected 
cell) . The microorganism may be human or other animal or 
plant pathogen, or it may be nonpathogenic. It may be a 
5 soil or water organism, or one which normally lives inside 
other living things.' 

If the target organism is an animal, it may be a 
vertebrate or a nonvertebrate animal . Nonvertebrate animals 
are chiefly of interest when they act as pathogens or 

10 parasites, and the drugs are intended to act as biocidic or 
biostatic agents. Nonvertebrate animals of interest include 
worms, mollusks, and arthropods. 

The target organism may also be a vertebrate animal, 
i.e., a mammal, bird, reptile, fish or amphibian. Among 

15 mammals, the target animal preferably belongs to the order 
Primata (humans, apes and monkeys), Artiodactyla (e.g., 
cows, pigs, sheep, goats, horses), Rodenta (e.g., mice, 
rats) Lagomorpha (e.g., rabbits, hares), or Carnivora (e.g., 
cats, dogs) . Among birds, the target animals are preferably 

20 of the orders Anseriformes (e.g., ducks, geese, swans) or 
Galliformes (e.g., quails, grouse, pheasants, turkeys and 
chickens) . Among fish, the target animal is preferably of 
the order Clupeiformes (e.g., sardines, shad, anchovies, 
whitef ish, salmon) . 

25 

Target Tissues 

The term "target tissue" refers to any whole animal, 
physiological system, whole organ, part of organ, 
miscellaneous tissue, cell, or cell component (e.g., the 

30 cell membrane) of a target animal in which biological 
activity may be measured. 

Routinely in mammals one would choose to compare and 
contrast the biological impact on virtually any and all 
tissues which express the subject receptor protein. The 

35 main tissues to use are: brain, heart, lung, kidney, liver, 
pancreas, skin, intestines, adipose, stomach, skeletal 
muscle, adrenal glands, breast, prostate, vasculature, 
retina, cornea, thyroid gland, parathyroid glands, thymus, 
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bone marrow, bone, etc. 

Another classification would be by cell type: B cells, 
T cells, macrophages, neutrophils, eosinophils, mast cells, 
platelets, megakaryocytes, erythrocytes, bone marrow stomal 
5 cells, fibroblasts, neurons, astrocytes, neuroglia, 

microglia, epithelial cells (from any organ, e.g. skin, 
breast, prostate, lung, intestines etc), cardiac muscle 
cells, smooth muscle cells, striated muscle cells, 
osteoblasts, osteocytes, chondroblasts , chondrocytes, 
10 keratinocytes , melanocytes, etc. 

Of course, in the case of a unicellular organism, there 
is no distinction between the "target organism" and the 
"target tissue". 

15 Screening Assays 

Assays intended to determine the binding or the 
biological activity of a substance are called preliminary 
screening assays. 

Screening assays will typically be either in vitro 

20 (cell-free) assays (for binding to an immobilized receptor) 
or cell-based assays (for alterations in the phenotype of 
the cell) . They will not involve screening of whole 
multicellular organisms, or isolated organs. The comments 
on diagnostic biological assays apply mutatis mutandis to 

25 screening cell -based assays. 

In Vitro vs . In Vivo Assays 

The term in vivo is descriptive of an event, such as 
binding or enzymatic action, which occurs within a living 

30 organism. The organism in question may, however, be 

genetically modified. The term in vitro refers to an event 
which occurs outside a living organism. Parts of an 
organism (e.g., a membrane, or an isolated biochemical) are 
used, together with artificial substrates and/or conditions. 

3 5 For the purpose of the present invention, the term in vitro 
excludes events occurring inside or on an intact cell, 
whether of a unicellular or multicellular organism. 

In vivo assays include both cell -based assays, and 
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organismic assays. The cell -based assays include both assays 
on unicellular organisms, and assays on isolated cells or 
cell cultures derived from multicellular organisms. The 
cell cultures may be mixed, provided that they are not 
5 organized into tissues or organs. The term organismic assay 
refers to assays on whole multicellular organisms, and 
assays on isolated organs or tissues of such organisms. 

In vitro Diagnostic Methods . and Reagents 

10 

The in vitro assays of the present invention may be 
applied to any suitable analyte- containing sample, and may 
be qualitative or quantitative in nature. 

15 Sample 

The sample will normally be a biological fluid, such as 
blood, urine, lymph, semen, milk, or cerebrospinal fluid, or 
a fraction or derivative thereof, or a biological tissue, in 
the form of, e.g., a tissue section or homogenate . However, 

2 0 the sample conceivably could be (or derived from) a food or 

beverage, a pharmaceutical or diagnostic composition, soil, 
or surface or ground water. If a biological fluid or 
tissue, it may be taken from a human or other mammal, 
vertebrate or animal, or from a plant. The preferred sample 
25 is blood, or a fraction or derivative thereof. 

Binding and Reaction Assays 

The assay may be a binding assay, in which one step 
involves the binding of a diagnostic reagent to the analyte, 

3 0 or a reaction assay, which involves the reaction of a 

reagent with the analyte. The reagents used in a binding 
assay may. be classified as to the nature of their 
interaction with analyte: (1) analyte analogues, or (2) 
analyte binding molecules (ABM) . They may be labeled or 
35 insolubilized . 

In a reaction assay, the assay may look for a direct 
reaction between the analyte and a reagent which is reactive 
with the analyte, or if the analyte is an enzyme or enzyme 
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inhibitor, for a reaction catalyzed or inhibited by the 
analyte. The reagent may be a reactant, a catalyst, or an 
inhibitor for the reaction'. 

An assay may involve a cascade of steps in which the 
5 product of one step acts as the target for the next step. 
These steps may be binding steps, reaction steps, or a 
combination thereof. 

Signal Producing- System (SPS) 

10 

In order to detect the presence, or measure the amount, 
of an analyte, the assay must provide for a signal producing 
system (SPS) in which there is a detectable difference in 
the signal produced, depending on whether the analyte is 

15 present or absent (or, in a quantitative assay, on the 

amount of the analyte) . The detectable signal may be one 
which is visually detectable, or one detectable only with 
instruments. Possible signals include production of colored 
or luminescent products, alteration of the characteristics 

20 (including amplitude or polarization) of absorption or 

emission of radiation by an assay component or product, and 
precipitation or agglutination of a component or product. 
The term "signal" is intended to include the discontinuance 
of an existing signal, or a change in the rate of change of 

2 5 an observable parameter, rather than a change in its 

absolute value. The signal may be monitored manually or 
automatically. . 

In a reaction assay, the signal is often a product of 
the reaction. In a binding assay, it is normally provided 

30 by a label borne by a labeled reagent. 

Labels 

The component of the signal producing system which is 
most intimately associated with the diagnostic reagent is 
35 called the "label". A label may be, e.g., a radioisotope, a 
fluorophore, an enzyme, a co-enzyme, an enzyme substrate, an 
electron-dense compound, an agglutinable particle. 

The radioactive isotope can be detected by such means 
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as the use of a gamma counter or a scintillation counter or 
by autoradiography. Isotopes which are particularly useful 
for the pur-pose of ' the present invention include 3 H, 125 1 , 
131 I, 35 S, 14 C, 32 P and 33 P. 125 I is preferred for antibody 
5 labeling. 

The label may also be a fluorophore. When the 
f luorescently labeled reagent is exposed to light of the 
proper wave length, its presence can then be detected due to 
fluorescence. Among the most commonly used fluorescent 
10 labeling compounds are fluorescein isothiocyanate , 

rhodamine, phycoerythrin, phycocyanin, allophycocyanin, o- 
phthaldehyde and f luorescamine . 

Alternatively, fluorescence-emitting metals such as 
125 Eu, or others of the lanthanide series, may be 
15 incorporated into a diagnostic reagent using such metal 
chelating groups as diethylenetriaminepentaacetic acid 
(DTPA) of ethylenediamine-tetraacetic acid (EDTA) . 

The label may also be a chemiluminescent compound. The 
presence of the chemiluminescently labeled reagent is then 
2 0 determined by detecting the presence of luminescence that 
arises during the course of a chemical reaction. Examples 
of particularly useful chemiluminescent labeling compounds 
are luminol , isolumino, theromatic acridinium ester, 
imidazole, acridinium salt and oxalate ester. 

2 5 Likewise, a bioluminescent compound may be used for 

labeling. Bioluminescence is a type of chemi luminescence 
found in biological systems in which a catalytic protein 
increases the efficiency of the chemiluminescent reaction. 
The presence of a bioluminescent protein is determined by 

3 0 detecting the presence of luminescence. Important 

bioluminescent compounds for purposes of labeling are 
luciferin, lucif erase and aequorin. 

Enzyme labels, such as horseradish peroxidase and 
alkaline phosphatase, are preferred. When an enzyme label 
3 5 is used, the signal producing system must also include a 

substrate for the enzyme. If the enzymatic reaction product 
is not itself detectable, the SPS will include one or more 
additional reactants so that a detectable product appears. 
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An enzyme analyte may act as its own label if an enzyme 
inhibitor is used as a diagnostic reagent. 



Binding- Assay Formats 
5 Binding assays may be divided into two basic types, 

heterogeneous and homogeneous. In heterogeneous assays, the 
interaction between the affinity molecule and the analyte 
does not affect the label, hence, to determine the amount or 
presence of analyte, bound label must be separated from free 

10 label. In homogeneous assays, the interaction does affect 
the activity of the label, and therefore analyte levels can 
be deduced without the need for a separation step. 

In one embodiment, the ABM is insolubilized by coupling 
it to a macromolecular support, and analyte in the sample is 

15 allowed to compete with a known quantity of a labeled or 
specifically labelable analyte analogue. The "analyte 
analogue" is a molecule capable of competing with analyte 
for binding to the ABM, and the term is intended to include 
analyte itself. It may be labeled already, or it may be 

20 labeled subsequently by specifically binding the label to a 
moiety differentiating the analyte analogue from analyte. 
The solid and liquid phases are separated, and the labeled 
analyte analogue in one phase is quantified. The higher the 
level of analyte analogue in the solid phase, i.e., 

2 5 sticking to the ABM, the lower the level of analyte in the 

sample . 

In a "sandwich assay", both an insolubilized ABM, and a 
labeled ABM are employed. The analyte is captured by the 
insolubilized ABM and is tagged by the labeled ABM, forming 

3 0 a ternary complex. The reagents may be added to the sample 

in either order, or simultaneously. The ABMs may be the 
same or different. The amount of labeled ABM in the ternary 
complex is directly proportional to the amount of analyte in 
the sample . • • 

3 5 The two embodiments described above are both 

heterogeneous assays. However, homogeneous assays are 
conceivable. The key is that the label be affected by 
whether or not the complex is formed. 
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Conjugation Methods 

A label may be conjugated, directly or indirectly 
(e.g., through a labeled anti-ABM antibody), covalently 
(e.g., with SPDP) or noncovalently , to the ABM, to produce a 
5 diagnostic reagent. Similarly, the ABM may be conjugated to 
a solid phase support to form a solid phase ("capture") 
diagnostic reagent. 

Suitable supports include glass, polystyrene, 
polypropylene, polyethylene, dextran, nylon, amylases, 
10 natural and modified celluloses, polyacrylamides , agaroses, 
and magnetite. The nature of the carrier can be either 
soluble to some extent or insoluble for the purposes of the 
present invention. 

The support material may have virtually any possible 
15 structural configuration so long as the coupled molecule is 
capable of binding to its target. Thus the support 
configuration may be spherical, as in a bead, or 
cylindrical, as in the inside surface of a test tube, or the 
external surface of a rod. Alternatively, the surface may 
20 be flat such as a sheet, test strip, etc. 

Biological Assays 

A biological assay measures or detects a biological 
response of a biological entity to a substance. 

2 5 The biological entity may be a whole organism, an 

isolated organ or tissue, freshly isolated cells, an 
immortalized cell line, or a subcellular component (such as 
a membrane; this term should not be construed as including 
an isolated receptor) . The entity may be, or may be derived 

3 0 from, an organism which occurs in nature, or which is 

modified in some way. Modifications may be genetic 
(including radiation and chemical mutants, and genetic 
engineering) or somatic (e.g., surgical, chemical, etc.). 
In the case of a multicellular entity, the modifications may 
35 affect some or all cells. The entity need not be the target 
organism, or a derivative thereof, if there is a reasonable 
correlation between bioassay activity in the assay entity 
and biological activity in the target organism. 
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The entity is placed in a particular environment, which 
may be more or less natural. For example, a culture medium 
may, but need not, contain serum or serum substitutes, and 
it may, but need not, include a support matrix of some kind, 
it may be still, or agitated. It may contain particular 
biological or chemical agents, or have particular physical 
parameters (e.g., temperature), that are intended to nourish 
or challenge the biological entity. 

There must also be a detectable biological marker for 
the response. At the cellular level, the most common 
markers are cell survival and proliferation, cell behavior 
(clustering, motility) , cell morphology (shape, color) , and 
biochemical activity (overall DNA synthesis, overall protein 
synthesis, and specific metabolic activities, such as 
utilization of particular nutrients, e.g., consumption of 
oxygen, production of C0 2 , production of organic acids, 
uptake or discharge of ions) . 

The direct signal produced by the biological marker may 
be transformed by a signal producing system into a different 
signal which is more observable, for example, a fluorescent 
or colorimetric signal. 

The entity, environment, marker and signal producing 
system are chosen to achieve a clinically acceptable level 
of sensitivity, specificity and accuracy. 

In some cases, the goal will be to identify substances 
which mediate the biological activity of a natural 
biological entity, and the assay is carried out directly 
with that entity. In other cases, the biological entity is 
used simply as a model of some more complex (or otherwise 
inconvenient to work with) biological entity. In that 
event, the model biological entity is used because activity 
in the model system is considered more predictive of 
activity 'in the ultimate natural biological entity than is 
simple binding activity in an in vitro system. The model 
entity is used instead of the ultimate entity because the 
former is more expensive or slower to work with, or because 
ethical considerations forbid working with the ultimate 
entity yet . 
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The model entity may be naturally occurring, if the 
model entity usefully models the ultimate entity under some 
conditions. Or it may be non-naturally occurring, with 
modifications that increase its resemblance to the ultimate 
5 entity. 

Transgenic animals, such as transgenic mice, rats, and 
rabbits, have been found useful as model systems. 

In cell-based model assays, where the biological 
activity is mediated by binding to a receptor (target 
10 protein) , the receptor may be functionally connected to a 
signal (biological marker) producing system, which may be 
endogenous or exogenous to the cell. 
There are a number of techniques of doing this . 

15 " Zero -Hybrid" Systems 

In these systems, the binding of a peptide to the 
target protein results in a screenable or selectable 
phenotypic change, without resort to fusing the target 
protein (or a ligand binding moiety thereof) to an 

2 0 endogenous protein. It may be that the target protein is 
endogenous to the host cell, or is substantially identical 
to an endogenous receptor so that it can take advantage of 
the latter' s native signal transduction pathway. Or 
sufficient elements of the signal transduction pathway 

2 5 normally associated with the target protein may be 

engineered into the cell so that the cell signals binding to 
the target protein. 

"One-Hybrid" Systems 

3 0 In these systems, a chimera receptor, a hybrid of the 

target protein and an endogenous receptor, is used. The 
chimeric receptor has the ligand binding characteristics of 
the target protein and the signal transduction 
characteristics of the endogenous receptor. Thus, the 
35 normal signal transduction pathway of the endogenous 
receptor is subverted. 

Preferably, the endogenous receptor is inactivated, or 
the conditions of the assay avoid activation of the 
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endogenous receptor, to improve the signal-to-noise ratio. 

See Fowlkes USP 5,789,184 for a yeast system. 

Another type of "one-hybrid" system combines a peptide: 
DNA-binding domain fusion with an unfused target receptor 
that possesses an activation domain. 



"Two-Hybrid" System 

In a preferred embodiment, the cell -based assay is a 
two hybrid system. This term implies that the ligand is 

10 incorporated into a first hybrid protein, and the receptor 
into a second hybrid protein. The first hybrid also 
comprises component A of a signal generating system, and the 
second hybrid comprises component B of that system/ 
Components A and B, by themselves, are insufficient to 

15 generate a signal. However, if the ligand binds the 

receptor, components A and Bare brought into sufficiently 
close proximity so that they can cooperate to generate a 
signal . 

Components A and B may naturally occur, or be 
20 substantially identical to moieties which naturally occur, 
as components of a single naturally occurring biomolecule, 
or they may naturally occur, or be substantially identical 
to moieties which naturally occur, as separate naturally 
occurring biomolecules which interact in nature. 

25 

Two-Hybrid System: Transcription Factor Type 

In a preferred "two-hybrid" embodiment, one member of a 
peptide ligand : receptor binding pair is expressed as a 
fusion to a DNA-binding domain (DBD) from a transcription 

3 0 factor (this fusion protein is called the "bait") , and the 
other is expressed as a fusion to a transactivation domain 
(TAD) (this fusion protein is called the "fish", the "prey", 
or the "catch"). The transactivation domain should be 
complementary to the DNA-binding domain, i.e., it should 

35 interact with the latter so as to activate transcription of 
a specially designed reporter gene that carries a binding 
site for the DNA-binding domain. Naturally, the two fusion 
proteins must likewise be complementary. 
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This complementarity may be achieved by use of the 
complementary and separable DNA-binding and transcriptional 
activator domains of a single transcriptional activator 
protein, or one may use complementary domains derived from 
5 different proteins. The domains may be identical to the 

native domains, or mutants thereof. The assay members may 
be fused directly to the DBD or TAD, or fused through an 
intermediated linker. 

The target DNA operator may be the native operator 
10 sequence, or a mutant operator. Mutations in the operator 
may be coordinated with mutations in the DBD and the TAD. 
An example of a suitable transcription activation system is 
one comprising the DNA-binding domain from the bacterial 
repressor LexA and the activation domain from the yeast 
15 transcription factor Gal4 , with the reporter gene operably 
linked to the LexA operator. 

It is not necessary to employ the intact target 
receptor; just the ligand-binding moiety is sufficient. 

The two fusion proteins may be expressed from the same 

2 0 or different vectors. Likewise, the activatable reporter 

gene may be expressed from the same vector as either fusion 
protein (or both proteins) , or from a third vector. 

Potential DNA-binding domains include Gal4, LexA, and 
mutant domains substantially identical to the above. 
25 Potential activation domains include E. coli B42 , Gal4 

activation domain II, and HSV VP16, and mutant domains 
substantially identical to the above. 

Potential operators include the native operators for 
the desired activation domain, and mutant domains 
30 substantially identical to the native operator. 

The fusion proteins may comprise nuclear localization 
signals . 

The assay system will include a signal producing 
system, too. The first element of this system is a reporter 

3 5 gene operably linked to an operator responsive to the DBD 

and TAD of choice. The expression of this reporter gene 
will result,- directly or indirectly, in a selectable or 
screenable phenotype (the signal) . The signal producing 
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system may include, besides the reporter gene, additional 
genetic or biochemical elements which cooperate in the 
production of the signal. Such an element could be, for 
example, a selective agent in the cell growth medium. There 
may be more than one signal producing system, and the system 
may include more than one reporter gene. 

The sensitivity of the system may be adjusted by, e.g., 
use of competitive inhibitors of any step in the activation 
or signal production process, increasing or decreasing the 
number of operators, using a stronger or weaker DBD or TAD, 
etc . 

When the signal is the death or survival of the cell in 
question, or proliferation or . non-proliferation of the cell 
in question, the assay is said to be a selection. When the 
signal merely results in a detectable phenotype by which the 
signaling cell may be differentiated from the same cell in a 
nonsignaling state (either way being a living cell) , the 
assay is a screen. However, the term "screening assay" may 
be used in a broader sense to include a selection. When the 
narrower sense is intended, we will use the term 
"nonselective screen" . 

Various screening and selection systems are discussed 
in Ladner, USP 5,198,346. 

Screening and selection may be for or against the 
peptide: target protein or compound : target protein 
interaction. 

Preferred assay cells are microbial (bacterial, yeast, 
algal, protozooal), invertebrate, vertebrate (esp. 
mammalian, particularly human) . The best developed two- 
hybrid assays are yeast and mammalian systems. 

Normally, two hybrid assays are used to determine 
whether a protein X and a protein Y interact, by virtue of 
their ability to reconstitute the interaction of the DBD and 
the TAD. However, augmented two-hybrid assays have been 
used to detect interactions that depend on a third, non- 
protein ligand. 

For more guidance on two-hybrid assays, see Brent and 
Finley, Jr., Ann. Rev. Genet., 31:663-704 (1997); Fremont- 
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Racine, et al . , Nature Genetics, 277-281 (16 July 1997); 
Allen, et al . , TIBS, 511-16 (Dec. 1995); LeCrenier, et al . , 
BioEssays, 20:1-6 (1998); Xu, et al., Proc . Nat. Acad. sci. 
(USA), 94:12473-8 (Nov. 1992); Esotak, et al . , Mol . Cell. 
Biol., 15:5820-9 (1995); Yang, et al . , Nucleic Acids Res., 
23:1152-6 (1995); Bendixen, et al . , Nucleic Acids Res., 
22:1778-9 (1994); Fuller, et al . , BioTechniques, 25:85-92 
(July 1998); Cohen, et al . , PNAS (USA) 95:14272-7 (1998); 
Kolonin and Finley, Jr., PNAS (USA) 95:14266-71 (1998). See 
also Vasavada, et al . , PNAS (USA), 88:10686-90 (1991) 
(contingent replication assay), and Rehrauer, et al . , J. 
Biol. Chem. , 271:23865-73 91996) (LexA repressor cleavage 
assay) . 

Two-Hybrid Systems : reporter Enzyme type 

In another embodiment, the components A and B 
reconstitute an enzyme which is not a transcription factor. 

As in the last example, the effect of the 
reconstitution of the enzyme is a phenotypic change which 
may be a screenable change, a selectable change, or both. 

In vivo Diagnostic Uses 

Radio- labeled ABM may be administered to the human or 
animal subject. Administration is typically by injection, 
e.g., intravenous or arterial or other means of 
administration in a quantity sufficient to permit subsequent 
dynamic and/or static imaging using suitable radio-detecting 
devices. The dosage is the smallest amount capable of 
providing a diagnostically effective image, and may be 
determined by means conventional in the art, using known 
radio- imaging agents as a guide. 

Typically, the imaging is carried out on the whole body 
of the subject, or on that portion of the body or organ 
relevant to the condition or disease under study. The 
amount of radio- labeled ABM accumulated at a given point in 
time in relevant target organs can then be quantified. 

A particularly suitable radio-detecting device is a 
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scintillation camera, such as a gamma camera. A 
scintillation camera is a stationary device that can be used 
to image distribution of radio- labeled ABM. The detection 
device in the camera senses the radioactive decay, the 
distribution of which can be recorded. Data produced by the 
imaging system can be digitized. The digitized information 
can be analyzed over time discontinuously or continuously. 
The digitized data can be processed to produce images, 
called frames, of the pattern of uptake of the radio-labeled 
ABM in the target organ at a discrete point in time. In 
most continuous (dynamic) studies, quantitative data is ■ 
obtained by observing changes in distributions of 
radioactive decay in target organs over time. In other 
words, a time-activity analysis of the data will illustrate 
uptake through clearance of the radio- labeled binding 
protein by the target organs with time. 

Various factors should be taken into consideration in 
selecting an appropriate radioisotope. The radioisotope 
must be selected with a view to obtaining good quality 
resolution upon imaging, should be safe for diagnostic use 
in humans and animals, and should preferably have a short 
physical half-life so as to decrease, the amount of radiation 
received by the body. The radioisotope used should 
preferably be pharmacologically inert, and, in the 
quantities administered, should not have any substantial 
physiological effect. 

The ABM may be radio- labeled with different isotopes of 
iodine, for example 123 I, 125 T , or 131 I (see for example, U.S. 
Patent 4,609,725). The extent of radio-labeling must, 
however be monitored, since it will affect the calculations 
made based on the imaging results (i.e. a diiodinated ABM 
will result in twice the radiation count of a similar 
monoiodinated ABM over the same time frame) . 

In applications to human subjects, it may be desirable 
to use radioisotopes other than 125 I for labeling in order to 
decrease the total dosimetry exposure of the human body and 
to optimize the detectability of the labeled molecule 
(though this radioisotope can be used if circumstances 
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require) . Ready availability for clinical use is also a 
factor. Accordingly, for human applications, preferred 
radio- labels are for example, 99m Tc, 67 Ga, 68 Ga, 90 Y, lu in, 
113m In, 123 I, 18S Re, 188 Re or 211 At . 
5 The radio- labeled ABM may be prepared by various 

methods. These include radio-halogenation by the chloramine 
- T method or the lactoperoxidase method and subsequent 
purification by HPLC (high pressure liquid chromatography) , 
for example as described by J. Gutkowska et al in 

10 "Endocrinology and Metabolism Clinics of America: (198 7) 16 
(1) :183. Other known methods of radio- labeling can be used, 
such as IODOBEADS™. 

There are a number of different methods of delivering 
the radio- labeled ABM to the end-user. It may be 

15 administered by any means that enables the active agent to 
reach the agent's site of action in the body of a mammal. 
Because proteins are subject to being digested when 
administered orally, parenteral administration, i.e., 
intravenous, subcutaneous, intramuscular, would ordinarily 

2 0 be used to optimize absorption of an ABM, such as an 
antibody, which is a protein. 



EXAMPLES 
2 5 Animal Models. 

Obesity and subsequent hyperinsulinemia and 
hyperglycemia were induced by feeding a group of 3 week old 
mice (50 C57B1/6 males) a high-fat diet (Bio-Serve, 
Frenchtown, NJ, #F185 0 High Carbohydrate -High Fat) . Another 

30 group of 3 week old mice (20 C57B1/6 males) were fed the 
normal control diet (PMI Nutrition International Inc., 
Brentwood, MO, Prolab RMH3 00 0) . The mice were placed onto 
the respective diets immediately following weaning. Animal 
weights were determined weekly. Fasting blood-glucose and 

35 plasma insulin measurements were determined after 2,4, 8 
and 16 weeks, and 6 months, on the respective diets. 

Normal weight, normal fasting blood glucose and normal 
fasting plasma insulin levels are defined as the respective 
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mean values of the animals fed the control diet. 

Two of the "most typical" animals were selected for 
each group (Control, hyper insulinemic and Diabetic) at each 
time point ( 2,4, 8, and 16 weeks, and 6 months, after 
commencement of diet) for sacrifice. The selected mice were 
sacrificed and liver tissue obtained and frozen in liquid 
notrogen until processed for RNA isolation. 

Fasting Blood Glucose Levels. 

Blood glucose levels was measured from a drop of blood 
taken from the tip of the tail of fasted (6 hr) mice using a 
Lifescan Genuine One Touch glucometer. All measurements 
occurred between 3:00 pm and 5:00 pm. 

Plasma insulin measurements. 

Blood was collected from the tail of fasted (6hr) mice 
into a heparinized capillary tube and stored on ice. All 
collections occurred between 3:00 pm and 5:00 pm. Plasma 
was separated from red blood cells by centrif ugat ion for 10 
minutes at 8000 x g and then stored at -20 'C. Insulin 
concentrations were determined using the Rat Insulin ELISA 
kit and rat insulin standards (ALPCO) essentially as 
instructed by the manufacturer. Values were adjusted by a 
factor of 1.2 3 as determined by the manufacturer to correct 
for the species difference in cross-reactivity with the 
ant ibody . 

RNA isolation. 

Total RNA was isolated from livers using the RNA STAT- 
60 Total RNA/mRNA Isolation Reagent according to the 
manufacturer's instructions (Tel-Test, Friendswood, TX) . 

Sample Quantification and Quality Assessment 

Total RNA was quantified and assessed for quality on a 
Bioanalyzer RNA 60 00 Nano chip (Agilent) . Each chip 
contained an interconnected set of gel -filled channels that 
allowed for molecular sieving of nucleic acids. Pin- 
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non- over lapping and unrelated. Combinations of the above are 
also possible. 

In addition to the classes stated, the corresponding human 
5 gene clusters are also of interest. These may be obtained in 
a number of ways . First , one may search on Unigene 
(http: //www.ncbi .nlm.nih.gov/entrez/query. f cgi?db=unigene) 
for the identified human protein. Review the "hits" (each 
of which is a Unigene record) for those prefixed by "Hs . " 

10 Secondly, one may access the Unigene record for the mouse 
gene cluster (which is given in Master Table 1) ,' and then 
click on "Homologene" . This will bring up a new page which 
includes the section "Possible Homologous Genes" . One of 
the entries should be a Homo sapiens gene (considered by 

15 Unigene to be the most related human gene) ; click on its 
Unigene record link. 

Additional information of interest may be accessed by 
searching with the mouse gene accession # in the Mouse Gene 
Informatics database, at http://www.informatics.jax.org/. 



20 
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unnamed protein product 


RAR-related orphan receptor A, isoform a; RAR-related orphan receptor alpha; 
retinoic acid receptor-related orphan receptor alpha; transcription factor RZR-alpha; 
ROR-alpha 


RAR-related orphan receptor A, isoform c; RAR-related orphan receptor alpha; 
retinoic acid receptor-related orphan receptor alpha; transcription factor RZR-alpha; 
ROR-alpha 


RAR-related orphan receptor A, isoform b; RAR-related orphan receptor alpha; 
retinoic acid receptor-related orphan receptor alpha; transcription factor RZR-alpha; 
ROR-alpha 


RAR-related orphan receptor A, isoform d; RAR-related orphan receptor alpha; 
retinoic acid receptor-related orphan receptor alpha; transcription factor RZR-alpha; 
ROR-alpha 


| retinoid-related orphan receptor RZR-alpha - human | 


RAR-related orphan receptor B; RAR-related orphan receptor beta; retinoic 
acid-binding receptor beta; nuclear receptor RZR-beta 


|bA133M9.1 (RAR-related orphan receptor B) ! 


INUCLEAR RECEPTOR ROR-BETA (NUCLEAR RECEPTOR RZR-BETA) ! 


tumor necrosis factor (ligand) superfamily, member 10; Apo-2 ligand; TNF-related 
apoptosis inducing ligand TRAIL 


n A, Crystal Structure Of Apo2ITRAIL 


n B, Crystal Structure Of Trail-Dr5 Complex 


in D, Crystal Structure Of Trail-Sdi 5 
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Chai 


Chai 


Chai 


BAB71208.1 | 


o 

ON 


NP 002934.1 


NP_599022.1 


NP_599024.1 


IA56856 


NP_008845.2 


|CAD13276.1 ! 


IQ92753 


NP_OO3801.1 


ON 


1 6435529 | 


1 10835510 


NP 699169 






R(HI-D) 
-2.02 
















F:(HI-D) 
-10.21 








F:(C-HI)- 
2.45 






Mm.8858 
















Mm. 1062 








Mm.23336 






NM_0 13646 
NP_038674.1 
















NM_009425 
NP 033451.1 








AK018485 
2204249A 





IS) O LT) 



WO 2004/092416 



PCT/US2004/010191 



"a "5* -i 



< ^ 



CO ^ 
CO JO 



= CD 

0 ■ 

1 5 



6^6 



^ CO 



CD g W 

CO £ g 

q3 iS o 

I - 1 

m ^ "5) 



£ E 



Cl O 
CD q. 

E o 



2 c > 

2 <D CN 



■V 5 x § 

co a 9 ro 

CD cm" "5 9 s 

c I— ^ I— 

.2 CO ^ CO 

£ O co CD 



o ^ 

3, | 



WO 2004/092416 



PCT/US2O04/010191 



o 


o 


o 


o 


o 


o 


o 


o 


o 


o 


o 


o 


o 


o 


o 


o 


1 1.00e-169| 


CD 
CO 


CD 
CO 


CD 
CO 


CD 
CM 
(*- 


a> 

CSJ 


co 


CD 
CSJ 




CD 
O 

h- 


o 


CD 
O 


CD 
O 


o 


o 


CD 
CD 
CD 


CM 
CD 
CD 


co 
cn 
in 


cytochrome P-450 ! 


hrome P450, subfamily IIC, polypeptide 9; cytochrome P450, subfamily IIC 
(mephenytoin 4-hydroxylase), polypeptide 10; mephenytoin 4-hydroxylase; 
microsomal monooxygenase; xenobiotic monooxygenase; flavoprotein-linked 
monooxygenase 


cytochrome P-450 | 


Cytochrome P450 2C10 (CYPIIC10) (P450 MP-8) (S-mephenytoin 4-hydroxylase) 
(P-450MP) 


cytochrome P-450 S-mephenytoin 4-hydroxylase j 


cytochrome P450 I 


cytochrome P450, subfamily IIC (mephenytoin 4-hydroxylase), polypeptide 19; 
mephenytoin 4'-hydroxylase; microsomal monooxygenase; xenobiotic 
monooxygenase; flavoprotein-linked monooxygenase 


| S-mephenytoin 4'-hydroxyIase (EC 1.14.14.-) cytochrome P450 2C19 - human j 


Cytochrome P450 2C8 (CYPIIC8) (P450 form 1) (P450 MP-12/MP-20) (P450 IIC2) 
(S-mephenytoin 4-hydroxylase) 


| Unknown (protein for MGC:22146) I 


|cytochrome P-450 S-mephenytoin 4-hydroxylase I 


cytochrome P450, subfamily IIC, polypeptide 8 isoform 1; mephenytoin I 
4-hydroxylase; microsomal monooxygenase; xenobiotic monooxygenase; 
flavoprotein-linked monooxygenase; P450 form 1 


|cytochrome P450 2C8 - human 


cytochrome P450 arachidonic acid epoxygenase isoform, Cyp 2C8 [human, kidney, 
Peptide Partial, 485 aa] 


(cytochrome P-450 S-mephenytoin 4-hydroxylase 
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j chrome P450 2C17 - human (fragment) 
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cytochrome P450, subfamily IIIA, polypeptide 5; niphedipine oxidase; aryl 
hydrocarbon hydroxylase; xenobiotic monooxygenase; microsomal 
monooxygenase; flavoprotein-linked monooxygenase 


cytochrome P450, subfamily IIIA, polypeptide 4; nifedipine oxidase; P450-III, steroid 
inducible; glucocorticoid-inducible P450; cytochrome P450, subfamily IIIA 
(niphedipine oxidase), polypeptide 3 


Cytochrome P450 3A4 (Quinine 3-monooxygenase) (CYPIIIA4) (Nifedipine 
oxidase) (NF-25) (P450-PCN1) 


|cytochrome P450, subfamily IIIA (niphedipine oxidase), polypeptide 3 | 


| cytochrome P-450 nifedipine oxidase | 


| cytochrome P450-3A4 | 


| cytochrome P450 nifedipine oxidase | 


cytochrome P450, subfamily IIIA, polypeptide 7; aryl hydrocarbon hydroxylase; 
microsomal monooxygenase; xenobiotic monooxygenase; flavoprotein-linked 
monooxygenase 


| cytochrome P450 variant 3A7 | 


; cytochrome P450 polypeptide 43 | 


; cytochrome P450 polypeptide 43 | 


; cytochrome P450 polypeptide 43 I 


(cytochrome P450 subfamily IIIA polypeptide 43 I 


(thromboxane synthase j 


(thromboxane synthase ! 


(thromboxane synthase j 


(thromboxane synthase j 


(thromboxane synthase | 


thromboxane A synthase 1 (platelet, cytochrome P450, subfamily V), isoform 
TXS-I; TXA synthase 


thromboxane synthase j 


|Thromboxane-A synthase (TXA synthase) (TXS) | 


isoform 2; 


isoform 1 ; 


isoform 3; 
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Similar to insulin-like growth factor binding protein 1 


IGF-binding preprotein (AA -25 to 234) ] 


| insulin-like growth factor binding protein 1 1 


insulin-like growth factor binding protein 1 j 


[small IGF-binding-protein I 


asparagine synthetase; glutamine-dependent asparagine synthetase; TS1 1 cell 
cycle control protein 


Asparagine synthetase [glutamine-hydrolyzing] (Glutamine-dependent asparagine 
synthetase) (TS11 cell cycle control protein) 


|aspartate-ammonia ligase (EC 6.3.1.1) - human j 


similar to asparagine synthetase; glutamine-dependent asparagine synthetase; 
TS1 1 cell cycle control protein 


Chain A, Ligand-Free Human Glutathione S-Transferase M1a-1a 


similar to Glutathione S-transferase Mu 1 (GSTMM) (HB subunit 4) (GTH4) 
(GSTM1 a-1 a) (GSTM1 b-1 b) (GST class-mu 1 ) 


[glutathione transferase M1 j 


Chain A, Ligand-Free Human Glutathione S-Transferase M2-2 (E.C.2.5.1.18), 
Monodinic Crystal Form 
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interleukin 1 , beta 


lnterleukin-1 beta precursor (IL-1 beta) (Catabolin) I 


j interleukin 1 I 


| interleukin 1 beta | 


| Chain , lnterleukin-1 Beta From Joint X-Ray And Nmr Refinement j 


|Chain , lnterleukin-1 Beta (IL-1 Beta) (Mutant With Cys 8 Replaced By Ala (C8A) j 


| Chain , lnterleukin-1 Beta (Human) Mutant With Thr 9 Replaced By Gly (T9g) \ 


| Chain , lnterleukin-1 Beta (IL-1 Beta) (Mutant With Cys 71 Replaced By Ala) (C71A) 


| Chain , lnterleukin-1 Beta (IL-1 Beta) (Mutant With Cys 71 Replaced By Ser) (C71S) 


6-phosphofructo-2-kinase/fructose-2,6-biphosphatase 


|6-phosphofructo-2-kinase/fructose-2,6-bisphosphatase 


6-phosphofructo-2-kinase (EC 2.7.1.105) / fructose-2, 6-bisphosphate 
2-phosphatase (EC 3.1.3.46) 


(inducible 6-phosphofructo-2-kinase/fructose 2,6-bisphosphatase j 
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6-phosphofructo-2-kinase/fructose-2,6-biphosphatase2 (6PF-2-K/Fru-2,6-P2ASE 
heart-type isozyme) (PFK-2/FBPase-2) [Includes: 6-phosphofructo-2-kinase ; 
Fructose-2,6-bisphosphatase ] 


6-phosphofructo-2-kinase/fructose-2,6-biphosphatase 2; 
Fructose-2,6-bisphosphatase, cardiac isozyme 


|6-phosphofructo-2-kinase heart isoform 


|6-phosphofructo-2-kinase/fructose-2,6-biphosphatase 4 | 


6-phosphofructo-2-kinase (EC 2.7.1.1 05) / fructose-2, 6-bisphosphate 
2-phosphatase (EC 3.1.3.46) 


6-phosphofructo-2-kinase/fructose-2,6-biphosphatase 1 ; 
Fructose-2,6-bisphosphatase 
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|EYA2 homolog 1 


cystathionase isoform 1; cystathionine gamma-lyase; homoserine deaminase; 
homoserine dehydratase; cysteine desulfhydrase 


(Cystathionine gamma-lyase 


|bA4201 5.1 .2 {cystathionase (cystathionine gamma-lyase)) 


(cystathionine gamma-lyase (EC 4.4.1 .1 ) 


serine (or cysteine) proteinase inhibitor, clade B (ovalbumin), member 1; protease 
inhibitor 2 (anti-elastase), monocyte/neutrophil; protease inhibitor 2 (anti-elastase), 
monocyte/neutrophil derived 


serine (or cysteine) proteinase inhibitor, clade B (ovalbumin), member 9; protease 
inhibitor 9 (ovalbumin type) 


serine (or cysteine) proteinase inhibitor, clade B (ovalbumin), member 8; protease 
inhibitor 8 (ovalbumin type) 


serine (or cysteine) proteinase inhibitor, clade B (ovalbumin), member 10; protease 
inhibitor 10 (ovalbumin type, bomapin) 


serine (or cysteine) proteinase inhibitor, clade B (ovalbumin), member 6; protease 
inhibitor 6 (placental thrombin inhibitor) 


Chain A, Human Plasminogen Activator lnhibitor-2.[loop (66-98) Deletionmutant] 
Complexed With Peptide Mimicking The Reactive Center Loop 


| placental thrombin inhibitor - human 
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similar to Glutathione S-transferase theta 1 (GST class-theta) (Glutathione 

transferase T1-1) 

glutathione S-transferase theta 1 

glutathione S-transferase theta 1 

similar to data source:SPTR, source key:Q60928, evidence:ISS~putative~similarto 


GAMMA-GLUTAMYLTRANSPEPTIDASE PRECURSOR (EC 2.3.2.2) (GAMMA- 
GLUTAMYLTRANSFERASE) (GGT) 
hypothetical protein FLJ90165 
nicotinamide N-methyltransferase 

thioether S-methyltransferase-like; similar to P40936 (PID:g731019) 


Indolethylamine N-methyltransferase (Aromatic alkylamine N-methyltransferase) 
(Indolamine N-methyltransferase) (Arylamine N-methyltransferase) (Amine 
N-methyltransferase) 

indolethylamine N-methyltransferase; thioester S-methyltransferase-like 


Unknown (protein for IMAGE:520921 8) ] 
ZNF228 protein 

similar to ZNF228 protein 
zinc finger protein 228 


Zinc finger protein 226 
zinc finger protein 226 

zinc finger protein 93 homolog; zinc finger protein homologous to mouse Zfp93; 
zinc finger protein homologous to Zfp93 in mouse; zinc finger protein 93 homolog 
(mouse) 

Hypothetical zinc finger-like protein 
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| similar td'Zinc finger protein 229 


| zinc finger protein 


zinc finger protein 45 (a Kruppel-associated box (KRAB) domain polypeptide); Zinc 
finger protein-45 (a Kruppel-associated box (KRAB) domain 


|Zinc finger protein ZNF45 


hydroxy-delta-5-steroid dehydrogenase, 3 beta- and steroid delta-isomerase 1 ; 
Hydroxy-delta-5-steroid dehydrogenase, 3 beta- and steroid 


1 3-beta-hydroxysteroid dehydrogenase/delta-5-delta-4-isomerase 


hydroxy-delta-5-steroid dehydrogenase, 3 beta- and steroid delta-isomerase 2; 
Hydroxy-delta-5-steroid dehydrogenase, 3 beta- and steroid 


1 3-beta-hydroxysteroid dehydrogenase gene I 


dJ871G17.4 (novel 3-beta hydroxysteroid dehydrogenase/isomerase family 
member) ! 


1 3-beta-hydroxysteroid dehydrogenase | 


similar to dJ871G17.4 (novel 3-beta hydroxysteroid dehydrogenase/isomerase 
family member) 


| similar to 3-beta-hydroxysteroid dehydrogenase | 


similar to 3 BETA-HYDROXYSTEROID DEHYDROGENASE/DELTA 
5-->4-ISOMERASE (3BETA-HSD) 


1 3 beta-hydroxy-delta 5-C27-steroid oxidoreductase I 


1 3 beta-hydroxy-delta 5-C27-steroid oxidoreductase j 


similar to dJ871G17.6 (novel 3-beta hydroxysteroid dehydrogenase/isomerase 
family member ) 


dJ871G17.6 (novel 3-beta hydroxysteroid dehydrogenase/isomerase family 
member ) 


3 beta-hydroxysteroid dehydrogenase homolog pseudogene f 
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cytochrome P450, subfamily IlC (mephenytoin 4-hydroxylase), polypeptide 18; 
cytochrome P450, subfamily IlC (mephenytoin 4-hydroxylase), polypeptide 17; 
microsomal monooxygenase; flavoprotein-linked monooxygenase 


|Cytochrome P450 2C18 (CYPIIC18) (P450-6B/29C) ] 


cytochrome P450, subfamily IlC (mephenytoin 4-hydroxylase), polypeptide 19; 
mephenytoin 4'-hydroxylase; microsomal monooxygenase; xenobiotic 
monooxygenase; flavoprotein-linked monooxygenase 


| cytochrome P-450 ) 


cytochrome P450, subfamily IlC, polypeptide 9; cytochrome P450, subfamily IlC 
(mephenytoin 4-hydroxylase), polypeptide 10; mephenytoin 4-hydroxylase; 
microsomal monooxygenase; xenobiotic monooxygenase; flavoprotein-linked 
monooxygenase 


| cytochrome P-450 j 


| S-mephenytoin 4'-hydroxylase (EC 1.14.14.-) cytochrome P450 2C1 9 - human | 


| cytochrome P450 | 


Cytochrome P450 2C10 (CYPIIC10) (P450 MP-8) (S-mephenytoin 4-hydroxylase) 
(P-450MP) 


| cytochrome P-450 S-mephenytoin 4-hydroxylase j 


[cytochrome P450 - human | 


Cytochrome P450 2C8 (CYPIIC8) (P450 form 1) (P450 MP-12/MP-20) (P450 IIC2) 
(S-mephenytoin 4-hydroxylase) 


aldehyde dehydrogenase (NAD) (EC 1.2.1.3) 5 precursor, mitochondrial 


bA1 1 3024.2 (aldehyde dehydrogenase 1 family, member B1 (ALDH5 ALDHX)) j 


laldehyde dehydrogenase 1 family, member B1 ; aldehyde dehydrogenase 5 | 


|Aldehyde dehydrogenase X, mitochondrial precursor (ALDH class 2) | 


NP_000763.1 


|P33260 | 


NP_000760.1 


|BAA00123.1 | 
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|AAB23864.2 | 


|F38462 j 


|1506290A j 
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|AAA52157.1 | 
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similar to Aldehyde dehydrogenase, mitochondrial precursor (ALDH class 2) 
(ALDHI) (ALDH-E2) 


Chain A, Human Mitochondrial Aldehyde Dehydrogenase Complexed With Nad+ 
And Mn2+ 


aldehyde dehydrogenase 1 


aldehyde dehydrogenase 2 family (mitochondrial); aldehyde dehydrogenase 2, 
mitochondrial 


i precursor polypeptide (AA -36 to 479) j 


Aldehyde dehydrogenase 1A2 (Retinaldehyde-specific dehydrogenase type 2) 
(RALDH(II)) (RALDH-2) 


aldehyde dehydrogenase 1 family, member A2; retinaldehyde dehydrogenase 2 j 


| aldehyde dehydrogenase 1 


aldehyde dehydrogenase 1A1; aldehyde dehydrogenase 1 , soluble; aldehyde 
dehydrogenase, liver cytosolic; ALDH class 1; acetaldehyde dehydrogenase 1; 
retinal dehydrogenase 1 


| aldehyde dehydrogenase 1A3; aldehyde dehydrogenase 6 I 


| aldehyde dehydrogenase 2 (EC 1.2.1.3) | 


|Similar to aldehyde dehydrogenase 1 family, member A2 


RALDH2-T 


|aldehyde dehydrogenase I - human (fragment). 


|formyltetrahydrofolate dehydrogenase isoform a 


| similar to 10-formyltetrahydrofolate dehydrogenase 


1 10-formyltetrahydrofolate dehydrogenase (10-FTHFDH) 


Similar to RIKEN cDNA 061 0025L1 5 gene 


| protein expressed in thyroid 


| protein expressed in thyroid 
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histidase 


| histidine ammonia-lyase; Histidine ammonia-lyase (histidase) j 


putative N-acetyltransferase Camello 2 


N-acetyltransferase 8; kidney- and liver-specific gene product; kidney- and 
liver-specific gene 


IGLA | 


| kidney- and liver-specific gene j 


[hypothetical protein TSC501 | 


phosphodiesterase 3A 


cGMP-inhibited 3',5-cyclic phosphodiesterase A (Cyclic GMP inhibited 
phosphodiesterase A) (CGI-PDE A) 


(phosphodiesterase 3A, cGMP-inhibited 


|cGMP-inhibited cAMP phosphodiesterase (EC 3.1 .4.-), myocardial form - human 


| cyclic nucleotide phosphodiesterase | 


(phosphodiesterase 3B, cGMP-inhibited j 


similar to RIKEN cDNA 231 0032D1 6 


KIAA1 434 protein j 


unnamed protein product j 


BAB61 863.1 


|NP_002099.1 j 


NP_057431.1 


NP_003951.2 


|BAA71643.1 | 


|AAH12626.1 


|T44342 ! 


CAA06304.1 


Q14432 


|NP_000912.2 I 


|A44093 ! 


|CAA64774.1 | 


|NP_000913.1 1 


XP_045585.1 


BAA92672.1 | 


| BAA9 1994.1 | 
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UDP-glucose dehydrogenase 


uridine diphosphoglucose dehydrogenase (EC 1 .-.-.-) j 


UDP glucose 6-dehydrogenase | 


uridine diphospho-glucose dehydrogenase | 


uridine diphospho-glucose dehydrogenase i 


leukemia inhibitory factor receptor precursor 


i leukaemia inhibitorv factor receptor, LIF receptor [human, placenta, Peptide, 1078 I 


laal I 


(oncostatin M receptor j 


| leukemia inhibitory factor receptor I 


glucose-6-phosphatase, catalytic 


| Unknown (protein for MGC.22459) 


| islet-specific glucose-6-phosphatase catalytic subunit-related p 


hypothetical protein dJ726C3.2 [ 


| hypothetical protein dJ726C3.2 
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hypothetical protein dJ726C3 .2 [ 

hypothetical protein dJ726C3 .2 


ADP-ribosylation factor 4-like 
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similar to ADP-ribosylation-like 4 
ADP ribosylation factor-like protein 

malate dehydrogenase (oxaloacetate-decarboxylating) (NADP) (EC 1 .1 .1 .40) 
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jactivin A type IB receptor, isoform b precursor; serine(threonine) protein kinase 


[ activin type 1 receptor SKR2, splice form 2 


Jactivin A type IB receptor, isoform c precursor; serine(threonine) protein kinase 


| activin type 1 receptor SKR2 splice form 3 


transforming growth factor, beta receptor 1 (activin A receptor type ll-like kinase, 
53kDa); transforming growth factor, beta receptor 1 (activin A receptor type ll-like 
kinase, 53kD) 


| similar to activin receptor-like kinase 7 


jactivin receptor-like kinase 7 


Chain A, Cytoplasmic Domain Of Unphosphorylated Type I Tgf-Beta Receptor 
Crystallized Without Fkbpl 2 


bone morphogenetic protein receptor, type IB; serine/threonine receptor kinase 


bone morphogenetic protein receptor, type IA precursor; activin A receptor, type 
ll-like kinase 3 


jactivin type I receptor SKR2-1 j 


activin A type I receptor precursor; hydroxyalkyl-protein kinase; activin A receptor, 
type ll-like kinase 2 


Serine/threonine-protein kinase receptor R3 precursor (SKR3) (Activin receptor-like 
kinase 1 ) (ALK-1 ) (TGF-B superfamily receptor type I) (TSR-I) 


activin A receptor type ll-like 1 ; Activin A receptor, type ll-like kinase 1 I 


activin type I receptor SKR2-2 j 


'activin type I receptor SKR2-3 - human j 


transforming growth factor beta receptor type IlB precursor | 


stearoyl-CoA desaturase (delta-9-desaturase) 


|NP_064732.1 
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Acyl-CoA desaturase (Stearoyl-CoA desaturase) (Fatty acid desaturase) 
(Delta(9)-desaturase) 


| Unknown (protein for MGC: 10264) 


| stearoyi CoA desaturase 


|PRO0998 


| Unknown (protein for MGC:10270) 


|stearoyl-CoA desaturase - human (fragment) 


| hypothetical protein 


Cytochrome P450 7A1 (Cholesterol 7-alpha-monooxygenase) (CYPVII) 
(Cholesterol 7-alpha-hydroxylase) 


cytochrome P450, subfamily VIIA, polypeptide 1 ; cholesterol 7-alpha-hydroxylase; 
cholesterol 7 alpha-monooxygenase 


| oxysterol 7alpha-hydroxylase | 


(cytochrome P450, subfamily VIIB, polypeptide 1 ; oxysterol 7alpha-hydroxylase I 


cytochrome P450, subfamily VIIIB, polypeptide 1; 7 
alpha-hydroxy-4-cholesten-3-one 12-alpha-hydroxylase; sterol 12-alDha- 


Isterol 12-alpha hydroxylase CYP8B1 ! 


CYP7 n | 


calcyon 


calcyon 
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gadd45-related protein 


growth arrest and DNA-damage-inducible, gamma; GADD45-gamma; gadd-related 
protein, 17 kD 


| growth arrest and DNA damage inducible protein gamma 


similar to Nucleolar GTP-binding protein 1 (Chronic renal failure gene protein) 
(GTP-binding protein NGB) 


| unnamed protein product 


|G protein-binding protein CRFG; GTP-binding protein 


|G protein-binding protein CRFG 


| putative G-binding protein 


cytochrome P450, subfamily IIC (mephenytoin 4-hydroxylase), polypeptide 18; 
cytochrome P450, subfamily IIC (mephenytoin 4-hydroxylase), polypeptide 17; 
microsomal monooxygenase; flavoprotein-linked monooxygenase 


|Cytochrome P450 2C18 (CYPIIC18) (P450-6B/29C) | 


cytochrome P450, subfamily IIC (mephenytoin 4-hydroxylase), polypeptide 19; 
mephenytoin 4'-hydroxylase; microsomal monooxygenase; xenobiotic 
monooxygenase; flavoprotein-linked monooxygenase 


|cytochrome P-450 | 


[cytochrome P-450 j 


cytochrome P450, subfamily IIC, polypeptide 9; cytochrome P450, subfamily IIC 
(mephenytoin 4-hydroxylase), polypeptide 10; mephenytoin 4-hydroxylase; 
microsomal monooxygenase; xenobiotic monooxygenase; flavoprotein-linked 
monooxygenase 


cytochrome P450 


Cytochrome P450 2C10 (CYPIIC10) (P450 MP-8) (S-mephenytoin 4-hydroxylase) 
(P-450MP) 


S-mephenytoin 4'-hydroxylase (EC 1 .14.14.-) cytochrome P450 2C19 - human | 
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cytochrome P450 - human I 


Cytochrome P450 2C8 (CYPIIC8) (P450 form 1) (P450 MP-12/MP-20) (P4bU nuzj 
(S-mephenytoin 4-hydroxytase) mmmamtm 
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i cytochrome P-450 S-mephenytoin 4-hydroxylase 1 


jcytochrome P450 2C8 - human 1 


cytochrome P450 arachidonic acid epoxygenase isoform, Cyp 2C8 [human, kidney, 
Pfiptirte Partial, 485 aal 


cytochrome P-450 S-mephenytoin 4-hydroxylase 1 


1 cytochrome P-450 S-mephenytoin 4-hydroxylase j 


hypothetical protein BC0 1 5 1 48 


MER receptor tyrosine kinase 


I c-mer proto-oncogene tyrosine kinase i 


jtransforming protein (axl(-)) - human 


AXL receptor tyrosine kinase isoform 2; AXL transforming sequence/gene; 
oncogene AXL 


| Unknown (protein for MGC:34202) 


AXL receptor tyrosine kinase isoform 1; AXL transforming sequence/gene; 
oncogene AXL 


iTyrosine-protein kinase receptor UFO precursor (AXL oncogene) 


j unnamed protein product ^^^^ 


Tyrosine-protein kinase receptor TYR03 precursor (Tyrosine-protein kinase RSE) 
(Tyrosine-protein kinase SKY) (Tyrosine-protein kinase DTK) (Protein-tyrosine 
kinase byk) 


152418 1 


P10632 


_ CM 
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4 °i 


|AAA52160.1 


IS66382 


AAB35292.1 


AAA52161.1 


AAA52159.1 


NP_620134.1 


AAG33 129.1 
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TYR03 protein tyrosine kinase; Brt; Dtk; Sky; Tif; Tyro3 protein tyrosine kinase 
(sea-related receptor tyrosine kinase) 


protein-tyrosine kinase I 


receptor tyrosine kinase - human I 


ZP3 receptor precursor - human 1 


Similar to TYR03 protein tyrosine kinase 


TYR03 


Epidermal growth factor receptor precursor (Receptor protein-tyrosine kinase 
ErbB-1) 


I aberrant epidermal growth factor receptor I 


epidermal growth factor receptor (erythroblastic leukemia viral (v-erb-b) oncogene 
homolog, avian); epidermal growth factor receptor (avian erythroblastic leukemia 
viral (v-erb-b) oncogene homolog); Epidermal growth factor receptor 


Ip1 10 epidermal growth factor receptor I 


| truncated epidermal growth factor receptor ! 


| EGF (1 is 2nd base in codon) 


| epidermal growth factor receptor 


I epidermal growth factor receptor precursor 


v-erb-a erythroblastic leukemia viral oncogene homolog 4; avian erythroblastic 
leukemia viral (v-erb-b2) oncogene homolog 4; v-erb-a avian erythroblastic 
leukemia viral oncogene homolog-like 4 


v-erb-b2 erythroblastic leukemia viral oncogene homolog 3 (avian); Transformation 
gene ERBB-3; v-erb-b2 avian erythroblastic leukemia viral oncogene homolog 3 


| kinase-related transforming protein (erbB3) (EC 2.7.1 .-) precursor - human 


Receptor protein-tyrosine kinase erbB-3 precursor (c-erbB3) (Tyrosine kinase-type 
cell surface receptor HER3) 


| Chain A, Structure Of The Her3 (Erbb3) Extracellular Domain 


NP_006284.1 


BAA21781.1 


138412 


A56379 


AAH29925.1 


CAA5 1396.1 


P00533 


AAA52371.1 
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IAAG35786.1 


AAG35790.1 


CAA25282.1 
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Receptor protein-tyrosine kinase erbB-2 precursor (p185erbB2) (NEU 
proto-oncogene) (C-erbB-2) (Tyrosine kinase-type cell surface receptor HER2) 
(MLN 19) 


v-erb-b2 erythroblastic leukemia viral oncogene homolog 2, neuro/glioblastoma 
derived oncogene homolog; Avian erythroblastic leukemia viral (v-erb-b2) oncogene 
homolog 2; v-erb-b2 avian erythroblastic leukemia viral oncogene homolog 2 
(neuro/glioblastoma derived oncogene homolog) 


Similar to v-erb-b2 avian erythroblastic leukemia viral oncogene homolog 3 f 


j herstatin j 


microsomal epoxide hydrolase 


epoxide hydrolase 1, microsomal (xenobiotic); Epoxide hydroxylase 1 , microsomal 
(xenobiotic) 


| epoxide hydrolase j 


| precusor polypeptide (AA -20 to 435) j 


| microsomal epoxide hydrolase (EC 3.3.2.3) 


aldeyde oxidase 1 


|Aldehyde oxidase 


| aldehyde oxidase 1 


Xanthine dehydrogenase/oxidase [Includes: Xanthine dehydrogenase (XD); 
Xanthine oxidase (XO) (Xanthine oxidoreductase)] 


J xanthine dehydrogenase 


|xanthene dehydrogenase; xanthine oxidase; xanthine dehydrogenase 


|similar to Xanthine dehydrogenase/oxidase 


similar to ALDEHYDE OXIDASE HQMOLOG-1-data source:SPTR, source 
key:Q9ESH4, evidence:ISS-putative 


P04626 


NP_004439.1 


IAAH02706.1 I 


IAAD56009.2 I 


AAC41 694.1 


NP 000111.1 


AAA52389.1 , 


ICAA68486.1 


|AAA59580.1 


BAB40305.1 


IQ06278 
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P47989 


JAAA75287.1 


|NP_000370.1 
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cytochrome P450, subfamily VIIIB, polypeptide 1; 7 
alpha-hydroxy-4-cholesten-3-one 12-alpha-hydroxylase; sterol 
1 2-alpha-hydroxylase 


[sterol 1 2-alpha hydroxylase CYP8B1 


| prostacyclin synthase 


| prostacyclin synthase 


(prostaglandin 12 (prostacyclin) synthase j 


(prostacyclin synthase i 


[prostacyclin synthase | 


[prostacyclin synthase I 


aldehyde dehydrogenase 1 


aldehyde dehydrogenase 1A1; aldehyde dehydrogenase 1, soluble; aldehyde 
dehydrogenase, liver cytosolic; ALDH class 1; acetaldehyde dehydrogenase 1; 
retinal dehydrogenase 1 


[aldehyde dehydrogenase 1 family, member A2; retinaldehyde dehydrogenase 2 I 


Aldehyde dehydrogenase 1 A2 (Retinaldehyde-specific dehydrogenase type 2) 
(RALDH(II)) (RALDH-2) 


| aldehyde dehydrogenase 1 A3; aldehyde dehydrogenase 6 ] 


similar to Aldehyde dehydrogenase, mitochondrial precursor (ALDH class 2) 
(ALDHI) (ALDH-E2) 


| aldehyde dehydrogenase 


Chain A, Human Mitochondrial Aldehyde Dehydrogenase Complexed With Nad+ 
And Mn2+ 


aldehyde dehydrogenase 2 family (mitochondrial); aldehyde dehydrogenase 2, 
mitochondrial 


precursor polypeptide (AA -36 to 479) j 


aldehyde dehydrogenase (NAD) (EC 1 .2.1 .3) 5 precursor, mitochondrial j 


aldehyde dehydrogenase 1 family, member B1 ; aldehyde dehydrogenase 5 ] 


|bA1 13024.2 (aldehyde dehydrogenase 1 family, member B1 (ALDH5 ALDHX)) | 
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|AAC63037.1 


|AAG31784.1 I 


|BAA28219.1 I 


|NP_000952.1 I 


|BAA11910.1 ; 


|AAG31785.1 [ 


|AAG31783.1 


AAC51652.1 


NP_000680.2 


|NP_003879.1 I 


094788 


|NP_000684.1 j 


XP_007012.1 


|AAA51693.1 | 


6137677 


NP 000681.1 


CAA68290.1 | 


A40872 j 


NP_000683.2 | 


CAD13246.1 | 
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Aldehyde dehydrogenase X, mitochondrial precursor (ALDH class 2) j 


RALDH2-T j 


Similar to aldehyde dehydrogenase 1 family, member A2 j 


aldehyde dehydrogenase I - human (fragment). J 


aldehyde dehydrogenase 2 (EC 1 .2.1 .3) j 


formyltetrahydrofolate dehydrogenase isoform a j 


10-formyltetrahydrofolate dehydrogenase (10-FTHFDH) | 


similar to 1 0-formyltetrahydrofolate dehydrogenase ; 


cytokine receptor related protein 4 


cytokine receptor-like factor 3 


cytokine receptor-like molecule 9 


similar to cytokine receptor-like molecule 9 


aquaporin 8 


aquaporin 8 i 


enoyl-Coenzyme A, hydratase/3-hydroxyacyl Coenzyme A dehydrogenase 


3-hydroxyacyl-CoA dehydrogenase; peroxisomal enoyl-CoA hydratase ! 


solute carrier family 39 (zinc transporter), member 4 


solute carrier family 39 (zinc transporter), member 4 | 


E-cadherin 


P30837 | 


BAA34786.1 | 


AAH30589.1 | 


139431 | 


AAB59500.1 | 


NP_036322.2 


075891 ! 
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AAD02422.1 


AAH23567.1 | 


NP 057070.1 j 


XP_065910.1 
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cadherin 1 , type 1 preproprotein; calcium-dependent adhesion protein, epithelial; 
cadherin 1, E-cadherin (epithelial); uvomorulin; cell-CAM 120/80; Arc-1 


| E-cadherin 


| E-cadherin j 


| uvomorulin j 


| E-cadherin I 


|Cadherin-3 precursor (Placental-cadherin) (P-cadherin) i 


cadherin 3, type 1 preproprotein; P-cadherin; placental cadherin; cadherin 3, 
P-cadherin (placental); calcium-dependent adhesion protein, placental 


| Neural-cadherin precursor (N-cadherin) (Cadherin-2) j 


cadherin 2, type 1 preproprotein; N-cadherin 1; cadherin 2, N-cadherin (neuronal); 
neural cadherin; calcium-dependent adhesion protein, neuronal 


| N-cadherin 


jcadherin 2 precursor - human 


jcadherin 2, type 1 , N-cadherin (neuronal) j 


cadherin 4, type 1 preproprotein; cadherin 4, R-cadherin (retinal); R-cadherin; 
retinal cadherin 


| Cadherin-4 precursor (Retinal-cadherin) (R-cadherin) (R-CAD) | 


| N-cadherin j 


| N-cadherin j 


|unnamed protein product i 


chaperone-ABC1-like 


| unnamed protein product • j 


|chaperone, ABC1 activity of bd complex like j 


|Similar to RIKEN cDNA 061 001 2P1 8 gene I 


(hypothetical protein FLJ 12229 ] 


| Unknown (protein for MGC:36739) j 


[unknown j 
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| IJHUCN j 


j AAH3 6470.1 | 
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|CAA40773.1 | 
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tvisiae) homolog; 






acetyl-CoA synthetase isoform a; cytoplasmic acetyl-coenzyme A synthetase; 
acetate-CoA ligase; acyl-activating enzyme; acetate thiokinase; acetyl-CoA 
synthetase 






j A synthetase; 
; acetyl-CoA 




carboxylesterase - human (fragment). 


serine esterase N-terminal truncated (503 AA) 


carboxylesterase 


brain carboxylesterase hBr1 B 


cholinephosphotransferase 1 






choline phosphotransferase 1; cholinephosphotransferase 1; 
cholinephosphotransferase 1 alpha 


choline/ethanolaminephosphotransferase 


IMSTP022 


|AAPT1-like protein 


(cholinephosphotransferase 1 beta 


IPRO1101 


Golgi vesicular membrane trafficking protein p18; Bet1 (S. cere 
Betlp homolog 






ISimilar to acetyl-CoA synthetase 


I unnamed protein product 


acetyl-CoA synthetase isoform b; cytoplasmic acetyl-coenzyme 
acetate-CoA ligase; acyl-activating enzyme; acetate thiokinase 
synthetase 


| Unknown (protein for MGC: 19474) 


157004 1 


CAA37 147.1 


AAA83932.1 


BAA84995.1 | 


AAH20819.1 
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|NP 006081.1 1 
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AAD44019.1 
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| unnamed protein product j 


|dJ18C9.1.1 (similar to acetyl-coenzyme A synthetase, isoform 1) ] 


|dJ1 1 61 H23.1 (similar to acetyl-coenzyme A synthetase) I 


|dJ18C9.1 .2 (similar to acetyl-coenzyme A synthetase, isoform 2) J 


dJ568C1 1 .3 (novel AMP-binding enzyme similar to acetyl-coenzyme A synthethase 
(acetate-coA ligase)) 


similar to dJ568C1 1 .3 (novel AMP-binding enzyme similar to acetyl-coenzyme A 
synthethase (acetate-coA ligase)) 


| unnamed protein product j 


| KIAA1 846 protein ! 


| hypothetical protein FL J2 1 963 ! 


|dJ18C9.1.3 (similar to acetyl-coenzyme A synthetase, isoform 3) | 


L-kynurenine/alpha-aminoadipate aminotransferase; kynurenine aminotransferase 
II 


| Similar to L-kynurenine/alpha-aminoadipate aminotransferase j 


alpha-synuclein isoform NACP140; non A4 component of amyloid precursor 


| NACP/alpha-synuclein j 


phospholipid transfer protein 


(phospholipid transfer protein \ 


|dJ337018.1.2 (Phospholipid Transfer Protein (Lipid Transfer Protein II) (isoform 2)) 


| Similar to phospholipid transfer protein I 


deoxyribonuclease II, lysosomal; DNase II, lysosomal 


i hypothetical protein R31 240 2 [imported] i 


|BAB14127.1 1 


|CAB61786.2 | 


ICAB93422.4 | 


|CAC33037.2 


CAB75500.1 


XP_042770.2 


|BAC03853.1 | 


|BAB47475.1 | 


|NP_078836.1 | 


|CAC33039.2 | 


NP_057312.1 


|AAH31068.1 


NP_000336.1 


|AAC02 114.1 1 
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|NP_006218.1 | 
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|AAH05045.1 | 


NP_001366.1 
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ADAM 7 precursor (A disintegrin and metalloproteinase domain 7) (Sperm 
matuiation-related glycoprotein GP-83). 


a disintegrin and metalloproteinase domain 8 precursor _ 1 
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a disintegrin and metalloproteinase domain 9 preproprotein; nieltrin gamma J 
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fibroblast growth factor receptor 1 isoform 3 precursor; fins-related tyrosine kinase-2; 
heparin-binding growth factor receptor; FMS-like tyrosine kinase 2; basic fibroblast growth 
factor receptor 1; N-sam tyrosine kinase; FLG protein; protein-tyrosine kinase; 
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tyrosylprotein kinase; hydroxyaryl-protein kinase 
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fibroblast growth factor receptor 2 isoform 1 precursor; keratinocyte growth factor receptor; 
K-sam protem; protein tyrosine kinase, receptor like 14; FGF receptor; bacteria-expressed 
kinase; fibroblast growth factor receptor BEK; tyrosylprotein kinase; hydroxyaryl-protein 
kinase 


| heparin-binding growth factor receptor variant alpha-a2 


| keratinocyte growth factor receptor 2 isoform BEK j 


fibroblast growth factor receptor 1 isoform 7 precursor; fins-related tyrosine kinase-2; 
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factor receptor 1; N-sam tyrosine kinase; FLG protein; protein-tyrosine kinase; 
tyrosylprotein kinase; hydroxyaryl-protein kinase 
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fibroblast growth factor receptor 2 isoform 13 precursor; keratinocyte growth factor 
receptor; K-sam protein; protein tyrosine kinase, receptor like 14; FGF receptor; 
bacteria-expressed kinase; fibroblast growth factor receptor BEK; tyrosylprotein kinase; 
hydroxyaryl-protein kinase 
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fibroblast growth factor receptor 4 isoform 2 precursor; tyrosine kinase related to fibroblast 
growth factor receptor; tyrosylprotein kinase; protein-tyrosine kinase; hydroxyaryl-protein 
kinase 


fibroblast growth factor receptor 4, soluble-form splice variant 1 
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fibroblast growth factor receptor 4 variant 1 


fibroblast growtii factor receptor 2 isoform 3 precursor; keratinocyte growth factor receptor; 
K-sam protein; protein tyrosine kinase, receptor like 14; FGF receptor; bacteria-expressed 
kinase; fibroblast growth factor receptor BEK; tyrosylprotein kinase; hydroxyaryl-protein 
kinase 


Chain A, Crystal Structure Of The Tyrosine Kinase Domain Of Fibroblast Growth Factor 
Receptor 1 


fibroblast growth factor receptor 3 isoform 2 precursor; hydroxyaryl-protein kinase; 
tyrosine kinase JTK4 


| Chain A, The Fgfi2 Tyrosine Kinase Domain 


fibroblast growth factor receptor subtype 1, FGFRl {extracellular binding domain 
D(ri)-D(m)} [human, A159 endometrial carcinoma cells, Peptide Partial, 279 aa]. 


Cham C, Crystal Structure Of Fgfl In Complex With The Extracellular Ligand Binding 
Domain Of Fgf Receptor 1 (Fgfrl). 


| Chain C, Crystal Structure Of A Dimeric Fgf2-Fgfrl Complex 


fibroblast growth factor receptor 2 isoform 9 precursor; keratinocyte growth factor receptor; 
K-sam protein; protein tyrosine kinase, receptor like 14; FGF receptor; bacteria-expressed 
kinase; fibroblast growth factor receptor BEK; tyrosylprotein kinase; hydroxyaryl-protein 
kinase 


fibroblast growth factor receptor 1 isoform 5 precursor; fms-related tyrosine kinase-2; 
heparin-binding growth factor receptor; FMS-like tyrosine kinase 2; basic fibroblast growth 
factor receptor 1; N-sam tyrosine kinase; FLG protein; protein-tyrosine kinase; 
tyrosylprotein kinase; hydroxyaryl-protein kinase 


Chain E, Crystal Structure Of Fgf2 In Complex With The Extracellular Ligand Binding 
Domain Of Fgf Receptor 2 (Fgfr2). 
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| Chain B, Crystal Structure Of A Ternary Fgfl-Fgfr2-Heparin Complex. 


fibroblast growth factor receptor 1 isoform 6 precursorjfms-related tyrosine kinase-2; 
heparin-binding growth factor receptor; FMS-like tyrosine kinase 2; basic fibroblast growth 
factor receptor 1; N-sam tyrosine kinase; FLG protein; protein-tyrosine kinase; 
tyrosylprotein kinase; hydroxyaryl-protein kinase 


Chain E, Crystal Structure Of Ser252trp Apert Mutant Fgf Receptor 2 (Fgfr2) In Complex 


Chain E, Crystal Structure Of Pro253arg Apert Mutant Fgf Receptor 2 (Fgfr2) In Complex 
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Chain A, Ligand-Binding Portion Of Fibroblast Growth Factor Receptor 2 In Complex 
WithFgfl 
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K-sam protein; protein tyrosine kinase, receptor like 14; FGF receptor; bacteria-expressed 
kinase; fibroblast growth factor receptor BEK; tyrosylprotein kinase; hydroxyaryl-protein 
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|insulin-like growth factor I receptor, IGF-IR [human, Peptide, 1367 aa]. 


| insulin-like growth factor 1 receptor precursor 
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Chain A, Structure Of Stem Cell Factor 
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Chain A, Human Procarboxypeptidase B: Three-Dimensional Structure And Implications 
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N-methyltransferase) 


(indolethylamine N-methyltransferase; thioester S-methyltransferase-like | 
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ornithine aminotransferase precursor; Ornithine aminotransferase 

ornithine aminotransferase, OAT [human, gyrate atrophy of the choroid and retina 
(GACR) patient, Peptide Mutant, 439 aa] 

Chain A, Human Ornithine Aminotransferase Complexed With The Neurotoxin 
Gabaculine 

ar to Ornithine aminotransferase, mitochondrial precursor (Ornithine-oxo-acid 
aminotransferase) 

proline dehydrogenase (oxidase) 1 ; proline oxidase 2; p53 induced protein 

proline oxidase 2 

proline dehydrogenase; PRODH 

proline dehydrogenase; proline oxidase 2 
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kidney and liver proline oxidase 1 

cytochrome P450, subfamily IIA (phenobarbital-inducible), polypeptide 13 
Cytochrome P450 2A1 3 (CYPIIA1 3) 
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cytochrome P450 2A4 - human 
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(Brain carboxylesterase hBr1) 
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(Delta14-SR) (Transmembrane 7 superfamily member 2) (Another new gene 1) 
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| keratin 13, type I, cytoskeletal, long splice form 


| keratin type 16 


| keratin 16; keratin, type I cytoskeletal 16: cytokeratin 16 
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| type I hair keratin 5 ; Ha-5 ; hard keratin, type I, 5 


keratin 24 


Keratin, type I cuticular HA5 (Hair keratin, type 1 HA5). 


| HHa5 hair keratin type I intermediate filament j 
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type I hair keratin 4; hard keratin, type I, 4 

type I hair keratin 2; Ha-2; hard keratin, type I, 2; keratin, hair, acidic,2 
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Keratin, type I cuticular HA2 (Hair keratin, type I HA2). 

hair type I acidic keratin 

keratin 18 

cytokeratin 9 

cytokeratin 18 (424 AA) 

keratin 9 

keratin Ha3-II, type I, hair 

Keratin 18 

keratin- 10 

type I hair keratin 7 

keratin 10; Keratin- 10 

Keratin, type I cuticular HA8 (Hair keratin, type I HA8). 
type I hair keratin 8 
Similar to keratin 1 8 
type I hair keratin 7 
unnamed protein product 
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netrin 4 precursor 


Usher syndrome type Ila protein 


netrin 4; beta-netrin 


KIAA1907 protein 


Similar to Laminin chain beta 2 


laminin, gamma 1 precursor; formerly LAMB 2 


|Laminin gamma-1 chain precursor (Laminin B2 chain) . 


similar to laminin alpha 3b chain 


laminin, gamma 3 precursor 


|Laminin gamma-3 chain precursor (Laminin 12 gamma 3) . | 


laminin alpha 3b chain 


unnamed protein product 


netrin 1; netrin 1, mouse, homolog of 


laminin A chain 


Laminin alpha- 1 chain precursor (Laminin A chain) . 
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polymerase 


polymerase 1 


guanine nucleotide binding protein (G protein), alpha 14; guanine nucleotide-binding 
protein 14 


GTP-binding protein alpha q subunit j 


GTP-binding protein alpha q | 


Guanine nucleotide-binding protein G(Y), alpha subunit (Alpha-1 1 ) \ 


Guanine nucleotide-binding protein G(q), alpha subunit | 


GTP-binding protein alpha-q - human (fragment) I 


guanine nucleotide binding protein (G protein), alpha 11 (Gq class); Guanine 
nucleotide-binding protein, Gq class, GNA1 1 


(guanine nucleotide binding protein alpha 1 1 subunit I 


guanine nucleotide binding protein (G protein), q polypeptide; Guanine 
nucleotide-binding protein (G protein), q; similar to guanine nucleotide binding protein 
(G protein), q polypeptide (H. sapiens) 


| guanine nucleotide binding protein (G protein), alpha 15 (Gq class) J 


| similar to GNA15; ALPHA-1 6 


| similar to Guanine nucleotide-binding protein, alpha-1 1 subunit (GL2) 


| Guanine nucleotide-binding protein G(O), alpha subunit 2 


| guanine nucleotide binding protein alpha oB 


Guanine nucleotide-binding protein G(i), alpha-1 subunit (Adenylate cyclase-inhibiting 
G alpha protein) 


(hypothetical protein 


jguanine nucleotide-binding regulatory protein alpha-inhibitory subunit 


guanine nucleotide binding protein (G protein), alpha inhibiting activity polypeptide 2; 
Guanine nucleotide-binding protein (G protein), alpha- 


jguanine nucleotide binding protein (G protein), alpha activating activity polypeptide O 


Similar to guanine nucleotide binding protein (G protein), alpha inhibiting activity 
polypeptide 2 


| guanine nucleotide-binding protein alpha-i subunit 
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.guanine nucleotide binding protein (G protein), alpha inhibiting activity polypeptide 1 | 


guanine nucleotide binding protein (G protein), alpha inhibiting activity polypeptide 3; 
87U6 


guanine nucleotide-binding protein j 


similar to Guanine nucleotide-binding protein G(T), alpha-3 subunit (Gustducin alpha-3 
chain) | 


solute carrier family 27 (fatty acid transporter), member 4; fatty acid transport protein 4 


| Unknown (protein for IMAGE:3613739) | 


| Unknown (protein for MGC:16752) I 


| unnamed protein product | 


| solute carrier family 27 member 3; fatty acid transport protein 3 | 


| very long-chain acyl-CoA synthetase homolog 1 j 


solute carrier family 27 (fatty acid transporter), member 5; very long-chain acyl-CoA 
synthetase homolog 2; very long-chain acyl-CoA synthetase-related protein; likely 
ortholog of mouse solute carrier family 27 (fatty acid transporter), member 5 


solute carrier family 27 (fatty acid transporter), member 2; very long-chain 
fatty-acid-coenzyme A ligase 1 ; very-long-chain acyl-CoA synthetase 


J Similar to hypothetical protein MGC4365 


heat shock 70kDa protein 1 A; heat shock 70kD protein 1 A; heat shock-induced 
protein; dnaK-type molecular chaperone HSP70-1 


| Heat shock 70 kDa protein 1 (HSP70.1) (HSP70-1/HSP70-2) 


| heat shock 70kDa protein. 1B; heat shock 70kD protein 1B 


| dnaK-type molecular chaperone HSPA1L 


J heat shock 70kD protein 1 -like 


| heat shock 70kDa protein 1 -like; Heat-shock 70kD protein-like-1 


| Heat shock protein 70 testis variant . | 


| similar to heat shock protein j 


NP_002060.3 | 


NP_006487.1 


IAAA52584.1 1 


XP_170405.1 


NP_O05O85.1 


|AAH04268.1 I 


|AAH09959.1 I 


|BAB55156.1 j 


|NP_077306.1 | 


|NP_054750.1 j 


NP_036386.1 


o 
o 

i 


|AAH03654.1 j 


NP_005336.2 


|P08107 


|NP_005337.1 


|A29160 


|XP_175177.1 


|NP_005518.1 j 


|BAA32521.1 ! 


|XP 166348.1 










U:(C-D)+ 
2.94, 
U:(HI-D) 
2.87 


















+ 
S 
o 

±3 


























Mm.30487 












































AK009292 
BAB26196.1 


















Ml 2573 


AAA37863.1 | 

















lo o in o 

r-i tH CM 



WO 2004/092416 



PCT/US20O4/O10191 



in 



I IM 

I 8 



WO 2004/092416 



PCT/US2004/010191 



1.0e-30 














S o 


1.0e-72i 


1 .Oe-72 


1. Oe-72 


2 

o 


8.0e-62 


o 


o 
cK 


c 
oi 














jo 


VO 


VO 






N <N 










vo 


Ov 




VD 










Major epididyniis-specific protein E4 precursor (HE4) (Epididymal secretory protein E4) 
(WAP four- disulfide core domain protein 2 




brain-derived neurotrophic factor receptor precursor, short splice form 


| neurotrophic tyrosine kinase, receptor, type 2 ; 


| neurotrophin receptor tyrosine kinase type 2 truncated isoform 


| neurotrophin receptor tyrosine kinase type 2 truncated isoform 


BDNF/NT-3 growth factors receptor precursor (TrkB tyrosine kinase) (GP145-TrkB) (Trk-B). 


gene trkC protein 


(neurotrophic tyrosine kinase, receptor, type 3 j 


|TRKC ] 


|NT-3 growth factor receptor precursor (TrkC tyrosine kinase) (GP 145-TrkC) (Trk-C | 


[neurotrophic tyrosine kinase, receptor, type 1; Oncogene TRK | 


High affinity nerve growth factor receptor precursor (TRKl transforming tyrosine kinase 
protein) (pl40-TrkA) (Trk-A). 


[nerve giowth factor receptor precursor, high affinity | 


| Chain X, Ligand Binding Domain Of Human Trkb Receptor j 


Chain X, Crystal Structure Of Trkb-D5 Bound To Neurofrophin-45 1 




glucose-6-phospbatase, transport (glucose-6-phosphate) protein 1; Glucose-6-phosphate 
transporter- 1 


glucose-6-phosphate transporter ' J 


glucose 6-phosphate translocase j 
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metallocarboxypeptidase CPX-1 precursor 


Potential carboxypeptidase X precursor (Metallocarboxypeptidase 
CPX-1) . 


unnamed protein product 


similar to Potential carboxypeptidase-like protein X2 precursor 


|Potential carboxypeptidase-like protein X2 precursor j 


Unknown (protein for IMAGE:5752707) 


unnamed protein product 


Similar to AE binding protein 1 


adipocyte enhancer binding protein 1 precursor; AE-binding 
protein 1; aortic carboxypeptidase-like protein 


jadipocyte transcription factor, AEBPl j 


Similar to carboxypeptidase X (M14 family) 


Similar to carboxypeptidase Z 


carboxypeptidase E precursor 


CPZ gene product 


| carboxypeptidase E - human ] 
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carboxypeptidase E 


carboxypeptidase N, polypeptide 1, 50kD precursor 


similar to carboxypeptidase D 


carboxypeptidase D 


carboxypeptidase D precursor 


jCarboxypeptidase D precursor (gpl80) . ] 


| carboxypeptidase M (EC 3 . 4 . 17 . 12) precursor - human j 


carboxypeptidase M precursor 
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j brain myo-inositol monophosphatase A2b; IMPase A2b 


Chain A, Human Inositol Monophosphatase (E.C.3. 1.3.25) Dimer Complex With Gadolinium 
And Sulfate 


|inositol(myo)-l(or 4)-monophosphatase 1 
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hemoglobin alpha-2 i 


[chain A, Deoxy Rhbl.2 (Recombinant Hemoglobin) . ] 


jchain A, Deoxy Rhbl.l (Recombinant Hemoglobin) . { 


Chain A, Alpha -Ferrous -Carbonmonoxy, Beta-Cobaltous -Deoxy ! 
Hemoglobin (T State) . 


Chain A, Hemoglobin Thionville Alpha Chain Mutant With Val 1 
Replaced By Glu And An Acetylated Met Bound To The Amino 
Terminus 


[chain A, Hemoglobin (Alpha Vim) Mutant. j 


[chain A, R-State Human Carbonmonoxyhemoglobin Alpha-A53s j 


jchain A, Artificial Mutant (Alpha Y42h) Of Deoxy Hemoglobin. j 


Chain A, Hemoglobin A (Deoxy) Mutant With Arg A 141 And Arg C 
141 Deleted (Del R (A 141), R(C 141)) (Desarg (141alpha) ) . 


|Chain A, Deoxy Hemoglobin (A-Glyglygly-C : vim, L29w; B,D:vlm) . | 


jchain A, Deoxy Hemoglobin (A, C:vlm, V621; B, D :vlm, V671) . j 


jchain A, R State Human Hemoglobin [alpha V96w] , Carbonmonoxy. j 


Chain A, Deoxyhemoglobin T38w (Alpha Chains) , Vlg (Alpha And 
Beta Chains) . 


Chain A, Deoxy Hemoglobin (A-Gly-C : vim, L29f ,H58q; 
B,D:vlm,L106w) . 


[Chain A, Deoxy Hemoglobin (A-Glyglygly-C : vim, L29f,H58g 


jchain A, Deoxy Hemoglobin (A-Gly-C:vlm,L29w,H58q; B,D:vlm) . | 


jchain A, Cyanomet Hemoglobin (A-Gly-C :vlm, L29f ,H58q 


Chain A, Deoxygenated Structure Of A Distal Pocket Hemoglobin 
Mutant . 
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multidrug resistance-associated protein (MRP) -like protein-2 
(MLP-2) 


canalicular multispecif ic organic anion transporter j 


truncated ABCC12 protein 


multidrug resistance associated protein 


FLJ00002 protein 


ATP-binding cassette, sub-family C, member 10; multidrug 
resistance-associated protein 7 


sulfonylurea receptor 1 


(Sulfonylurea receptor 1 


sulfonylurea receptor 


alternative splice (exon 17) 


multidrug resistance-associated protein 7 


ATP-binding cassette, sub-family C, member 8; Sulfonylurea 
receptor; sulfonylurea receptor (hyperinsulinemia) 


ATP-binding cassette, sub-family C, member 9 isoform 
SUR2A-delta-14; sulfonylurea receptor 2A 


multidrug resistance protein 


ATP-binding cassette, sub-family C, member 1 isoform 5; 
multiple drug resistance-associated protein; multiple drug 
resistance protein 1; multidrug resistance protein 
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Unknown (protein for IMAGE : 335584 8) 


ATP-binding cassette, sub-family C, member 3 isoform MRP 3 A; 
canicular multispecif ic organic anion transporter 


sulfonylurea receptor 


cystic fibrosis transmembrane conductance regulator, 
ATP-binding cassette (sub-family C, member 7) ; cystic fibrosis 
transmembrane 

conductance regulator; ATP-binding cassette, sub-family C 
member 7; CFTR/MRP 


Cystic fibrosis transmembrane conductance regulator (CFTR) 
(cAMP-dependent chloride channel) . 


MRP3sl protein 


Similar to ATP-binding cassette, sub- family C (CFTR/MRP) , 
member 4 


anthracycline resistance associated protein 


Similar to hypothetical protein FLJ00002 


cystic fibrosis transmembrane conductance regulator 


ATP-binding cassette, subfamily B, member 4 isoform C; P 
glycoprotein 3/multiple drug resistance 3 ; 
P-glycoprotein-3/multiple drug resistance-3; multiple drug 
resistance 3 
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ATP-binding cassette, subfamily B, member 4 isoform A; P 
glycoprotein 3/multiple drug resistance 3; 
P-glycoprotein-3/multiple drug resistance-3; multiple drug 
resistance 3 


ATP-binding cassette, sub-family B (MDR/TAP) , member 1; P 
glycoprotein l/multiple drug resistance 1; 
P-glycoprotein-l/multiple drug resistance-1; multidrug 
resistance 1 


P-glycoprotein 


Multidrug resistance protein 1 (P-glycoprotein 1) (CD243 
antigen) . 


coded for by human cDNA M96936 (NID :gl80293) 


ATP-binding cassette, subfamily B, member 4 isoform B; P 
glycoprotein 3/multiple drug resistance 3; 
P-glycoprotein-3/multiple drug resistance-3; multiple drug 
resistance 3 


Similar to ATP-binding cassette, sub-family C (CFTR/MRP) , 
member 11 


|hypothetical protein DKFZp434I2115 . 1 


transmembrane chloride conductor protein 




Similar to KIAA1001 protein 


|KIAA1001 protein j 


(arylsulfatase A j 


arylsulfatase. A precursor | 


Similar to arylsulfatase A j 
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Human Arylsulfatase A 


Arylsulfatase A; Chain: P; Synonym: Cerebroside-3-Sulfate-Sulfatase; Ec: 3.1 .6.8 1 


N-acetylgalactosamine-6-sulfatase precursor; Galactosamine (N-acetyl)-6-sulfate 
sulfatase; chondroitinase 


Unknown (protein for MGC:24090) 1 


arylsulfatase F I 


similar to arylsulfatase F 1 


Unknown (protein for MGC:31932) 


arylsulfatase D precursor, isoform a 


Tumor-associated calcium signal transducer 1 precursor (Major gastrointestinal 
rumor-associated protein GA733-2) (Epithelial cell surface antigen) (Epithelial glycoprotein) 
(EGP) (Adenocarcinoma-associated antigen) (KSA) (KS 1/4 antigen) (Cell surface 
glycoprotein Trop-1). 


tumor-associated calcium signal transducer 1 precursor; membrane component, chromosome 4, 
surface marker (35kD glycoprotein); MK-1 antigen; antigen identified by monoclonal antibody 
AUAl 


| epithelial glycoprotein (EGP) precursor 


| carcinoma-associated antigen GA733-1 precursor 


tumor-associated calcium signal transducer 2 precursor; membrane component, chromosome 1, 
surface marker 1 (40kD glycoprotein, identified by monoclonal antibody GA733); epithelial 
glycoprotein- 1 


Tumor-associated calcium signal transducer 2 precursor (Pancreatic carcinoma marker protein 
GA733-1) (Cell surface glycoprotein Trop-2). 
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Synonym: Cerebroside-3-Sulfate-Sulfatase; Ec: 
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Arylsulfatase A; Chain: P; 


Arylsulfatase A; Chain: P; 
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cytochrome P450, subfamily IID, polypeptide 6; debrisoquine 4-hydroxylase; 
microsomal monooxygenase; xenobiotic monooxygenase; flavoprotein-linked 
monooxygenase 


debrisoquine 4-hydroxylase j 


cytochrome P450db1 | 


Synapsin III 


;sofonn Ilia j 


isoform IIIc | 


isoform Illb J 


| synapsin lib | 


| synapsin II isoform Ha j 


| synapsin II isoform lib | 


| Synapsin II j 


| synapsin lib \ 


| synapsin I isofonn lb; brain protein 4. 1 j 


| synapsin I isoform la; brain protein 4.1 | 


| synapsin I spliee form a j 


|SynapsinI (Brain protein 4.1). ; 


j synapsin lb 




ketohexokinase isoform b 


| ketohexokinase isoform a 


j ketohexokinase (fhictokinase) 


| ketohexokinase 


| ketohexokinase 


| ketohexokinase ' j 


| synapsin IE i 


| synapsin mi 


(synapsin III i 
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Ampophilin (Adipose differentiation-related protein) (ADRP). 


| adipose differentiation-related protein; adipophilin | 


| cargo selection protein (mannose 6 phosphate receptor binding protein; placental protein 1 7 j 


Cargo selection protein TIP47 (47 kDa mannose 6-phosphate receptor-binding protein) (47 
kDa MPR-binding protein) (Placental protein 17). 


| placental protein 1 7b 1 ; PP 1 7b 1 i 




protein tyrosine phosphatase type IVA, member 1; Protein tyrosine phosphatase IVAI 


protein tyrosine phosphatase type IVA, member 2 isoform 1; protein tyrosme phosphatase 
rVA2; protein tyrosine phosphatase IV A; phosphatase of regenerating liver 2 


Iptp-IVlb, PTP-IVl gene product j 


protein tyrosine phosphatase type IVA, member 3 isoform 1; potentially prenylated protein 
tyrosine phosphatase 


|protein tyrosine phosphatase - human (fragment). j 


|hPRL-3 j 


protein tyrosine phosphatase type IVA, member 3 isoform 2; potentially prenylated protein 
tyrosine phosphatase 
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adenylate cyclase 6 isoform b 
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Similar to adenylate cyclase 7 


unnamed protein product 


KIAA0520 protein 


adenylate cyclase 9 


Adenylate cyclase, type IX (ATP p] 
cyclase) . 


adenylyl cyclase type IX 




sulfotransferase family, cytosolic 
sulfotransferase ICl 


sulfotransferase ICl 


sulfotransferase family, cytosolic 
sulfotransferase family, cytosolic 
sulfotransferase 1C2 


|Sulfotransferase 1C2 (SULTIC) (SUI 


thyroid hormone sulfotransferase 


Unknown (protein for MGC: 133.56) 


'thyroid hormone sulfotransferase I 


Phenol -sulfating phenol sulfotrans 
(Thermostable phenol sulfotransf ei 
(ST1A3 ) . 
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sulfotransferase , estrogen-preferring 


Chain A, Crystal Structure Of Human Estrogen Sulfotransferase 
V269e Mutant In The Presence Of Paps 


Chain A, Crystal Structure Of Human Dehydroepiandrosterone 
Sulfotransferase In Complex With Substrate 


dehydroepiandrosterone sulfotransferase 


alcohol/hydroxysteroid sulfotransferase; hSTa 


Alcohol sulfotransferase (Hydroxysteroid Sulfotransferase) 
(HST) (Dehydroepiandrosterone sulfotransferase) (DHEA-ST) (ST2) 
(ST2A3) . 


hydroxysteroid sulfotransferase SULT2Bla 


hydroxysteroid sulfotransferase SULT2Bla 


Chain A, Crystal Structure Of The Human Hydroxysteroid 
Sulfotransferase In The Presence Of Pap. 


hydroxysteroid sulfotransferase SULT2Blb 




pleckstrin 2; pleckstrin 2 (mouse) homolog 


| pleckstrin 2 (mouse) homolog j 


| pleckstrin; p47 ] 


| Unkno wn (protein for MGC: 17111) 


| protein kinase C substrate protein P47 j 
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synaptotagmin-like 4 (granuphilin-a) 


unnamed protein product 


bA524D16A.2 .1 (novel protein similar to mouse granuphilin-a) 


bA524D16A.2 .2 (novel protein similar to mouse granuphilin-b) 


synaptotagmin-like 5 


unnamed protein product 


synaptotagmin-like 2 isoform b; chromosome 11 synaptotagmin 


KIAA1597 protein 


synaptotagmin-like 2 isoform a; chromosome 11 synaptotagmin 


Unknown (protein for IMAGE : 3942111) 


hypothetical protein SB146 


NADPH oxidase-related, C2 domain-containing protein 


Similar to NADPH oxidase-related, C2 domain- containing protein 


Similar to NADPH oxidase-related, C2 domain-containing protein 


NP_542775 
.1 


BAC04287. 
1 


1 •„ 


u 

U rH 


in 

cm' h 


BAB15030. 
1 


NP_115755 
.2 


BAB13423 . 
1 


NP_116561 
.1 


AAH09224 . 
1 


AAK67636. 
1 


NP_116261 
.1 


AAH15764. 
1 


AAH35725. 
1 


U:(C-D)+ 
2.22 




























Mm.ioD/4 




























NM_0 13757 
NP 038785.1 





























o 

rH 



(D 

o 


0 
0 

1.0e-154 


i.ue-izu 
1.0e-119 
1.0e-117 
1.0e-81 


3.0e-65 
7.0e-54 
8.0e-50 


1.0e-106 
1.0e-106 
1.0e-106 


1 -I 


I 
















similar to synaptotagmin-like protein 3-a 


transformkig giowth factor, beta 3 
fraosformmg growth factor-beta 3 
Similar to transforming growth factor, beta 3 


transiorrrung growtu tactor, beta l 

transfonning growth factor beta 2 

transfonning growth factor beta-2 precursor, long form 

Transfonning growth factor beta 1 precursor (TGF-beta 1). 

transfonning growth factor, beta 1 (Camurati-Engeknann disease); transforrning growth factor, 


Human Transfoirning Growth Factor Beta 3, Crystallized From Peg 4000 

Transforming Growth Factor Type Beta 2 (Tgf-B2). ; 

Chain A, Solution Structure Of Tgf-Bl, Nmr, Models 18-33 Of 33 Structuies 


general transcription factor IIF, polypeptide 1 (74kD subunit) 

general transcription factor IIF, polypeptide 1 (74kD subunit) 
RAP74 


Transcription Initiation Factor lif, Subunit; Chain: A, C, E, G; Fragment: Residues 
2-119; Synonym: Transcription Initiation Factor Rap30 

pleckstrin homology-like domain, family A, member 3; pleckstrin homology-like domain, 
family A, member 2 
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similar to keratin 17 ! 


|Keratin, type I cytoskeletal 10 (Cytokeratin 10) (KIO) (CK 10) . | 


keratin 10 (epidermolytic hyperkeratosis; keratosis palmaris et 
plantaris) 
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keratin 10; Keratin-10 


Similar to keratin, hair, acidic, 6 


type I hair keratin 6 


keratin-10 


type I hair keratin 5; Ha-5; hard keratin, type I, 5 


type I hair keratin 5 


|Keratin, type I cuticular HA5 (Hair keratin, type I HA5) . 


HHa5 hair keratin type I intermediate filament 


unnamed protein product 


keratin 24 


similar to keratin complex-1, acidic, gene C29; keratin 
complex-1, gene C2 9 
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type I hair keratin 1; Ha-1; hard keratin, type I, 1; keratin, 
hair, acidic,! 


|Keratin, type I cuticular HAl (Hair keratin, type I HAl) . ] 


type I hair keratin 3B; Ha-3II; hard keratin, type I, 311; 
keratin, hair, acidic, 3B 


type I hair keratin 2; Ha-2; hard keratin, type I, 2; keratin, 
hair, acidic, 2 


type I hair keratin 3A; Ha-3I; hard keratin, type 1,31; 
keratin, hair, acidic, 3A 


hair keratin acidic 3 -II 


keratin 20; cytokeratin 20;' keratin, type I cytoskeletal 20 


|Keratin, type I cuticular HA3-I (Hair keratin, type I HA3-I). J 


jkeratin 20, type I-like, cytoskeletal - human j 


[keratin Hal, type I, hair j 


similar to keratin, hair, acidic, 4 


jKeratin, type I cuticular HA2 (Hair keratin, type I HA2) . | 


hair type I acidic keratin 


type I hair keratin 4; hard keratin, type I, 4 


jKeratin, type I cuticular HA4 (Hair keratin, type I HA4) . j 


jkeratin Ha3-II, type I, hair 1 
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unnamed protein product 


Keratin, type I cytoskeletal 23 (Cytokeratin 23) uuj) lua. zi) . 


similar to keratin 17 


similar to Keratin, type I cytoskeletal 18 (Cytokeratin its) 
(K18) (CK 18) 


keratin 


mutant keratin 9 


similar to Keratin, type I cytoskeletal 18 (Cytokeratin 18) 
(K18) (CK IB) 


Unknown (protein for IMAGE: 5444378) 




galectin 4; lectin galactoside-binding soluble 4 
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cytochrome P450, family 2, subfamily C, polypeptide 18; 
cytochrome P.450, subfamily IIC (mephenytoin 4 -hydroxylase) , 
polypeptide 17; cytochrome P450, subfamily IIC (mephenytoin 
4 -hydroxylase) , polypeptide 18; microsomal monooxygenase; 
f 1 avoprot e in - 1 inked monooxygenas e 


ley tochrome P450 2C18 (CYPIIC18) (P450 -6B/29C) . 
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cytochrome P450, family 2, subfamily C, polypeptide 19; 
cytochrome P450, subfamily IIC (mephenytoin 4-hydroxylase), 
polypeptide 19; mephenytoin 4 1 -hydroxylase; microsomal 
monooxygenase; xenobiotic monooxygenase; f lavoprotein-linked 
monooxygenase 


cytochrome P-450 


cytochrome P-450 


cytochrome P45 0, family 2, subfamily C, polypeptide 9; 
cytochrome P450, subfamily IIC (mephenytoin 4-hydroxylase), 
polypeptide 10; mephenytoin 4-hydroxylase; microsomal 
monooxygenase; xenobiotic monooxygenase; f lavoprotein-linked 
monooxygenase; cytochrome P450, subfamily IIC (mephenytoin 
4-hydroxylase) , polypeptide 9 


Cytochrome P450 2C8 (CYPIIC8) (P450 form 1) (P450 MP-12/MP-20) 
(P450 IIC2) (S-mephenytoin 4-hydroxylase) . 


S-mephenytoin 4 1 -hydroxylase (EC 1.14.14.-) cytochrome P450 
2C19 


Unknown (protein for MGC: 22146) 


cytochrome P-450 S-mephenytoin 4-hydroxylase 


| cytochrome P450 ; 


Cytochrome P450 2C10 (CYPIICIO) (P450 MP-8) (S-mephenytoin 
4-hydroxylase) (P-450MP) . 
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cytochrome P450, family 2, subfamily C, polypeptide 8 isoform 
1; cytochrome P450, subfamily IIC (mephenytoin 4-hydroxylase), 
polypeptide 8; mephenytoin 4-hydroxylase; microsomal 
monooxygenase; xenobiotic monooxygenase; f lavoprotein-linked 
monooxygenase; P450 form 1 


cytochrome P450 arachidonic acid epoxygenase isoform, Cyp 2C8 
[human, kidney, Peptide Partial, 4 85 aa] . 


cytochrome P-450 S-mephenytoin 4-hydroxylase 


cytochrome P-450 S-mephenytoin 4-hydroxylase. 


[cytochrome P450 - human 


cytochrome P-45 0 S-mephenytoin 4-hydroxylase 


[cytochrome P450 2C17 - human 


cytochrome P450, family 2, subfamily C, polypeptide 8 isoform 
2; cytochrome P450, subfamily IIC (mephenytoin 4-hydroxylase), 
polypeptide 8; mephenytoin 4-hydroxylase; microsomal 
monooxygenase; xenobiotic monooxygenase; f lavoprotein-linked 
monooxygenase; P450 form 1 


cytochrome P450, family 2, subfamily E, polypeptide 1; 
cytochrome P450, subfamily HE (ethanol-inducible), polypeptide 
1; microsomal monooxygenase; xenobiotic monooxygenase; 
f lavoprotein-linked monooxygenase; cytochrome P450, subfamily 
HE (ethanol-inducible) 
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cytochrome P450, family 2, subfamily F, polypeptide 1; 1 
cytochrome P450, subfamily IIF, polypeptide 1; microsomal 
monooxygenase; xenobiotic monooxygenase; f lavoprotein-linked 
monooxygenase 


cytochrome P450 2F1 


cytochrome P450, family 2, subfamily B, polypeptide 6; 
cytochrome P450, subfamily IIB (phenobarbital-inducible), 
polypeptide 6 


{Cytochrome P450 2F1 (CYPIIFl) . j 


cytochrome P450-2B6 


cytochrome P450, family 2, subfamily A, polypeptide 13; 
cytochrome P450, subfamily IIA (phenobarbital-inducible), 
polypeptide 13 j 


[Cytochrome P45 0 2A13 (CYPIIA13) . j 


cytochrome P450, family 2, subfamily A, polypeptide 6; coumarin 
7 -hydroxylase; cytochrome P450, subfamily IIA 
(phenobarbital-inducible), polypeptide 3; cytochrome P450, 
subfamily IIA (phenobarbital-inducible), polypeptide 6; 
xenobiotic monooxygenase; f lavoprotein-linked monooxygenase 


|coumarin 7-hydroxylase (EC 1.14.14.-) cytochrome P450 2A6 


[cytochrome P450IIA j 


cytochrome P-450IIA (AA 1 - 489) 


Cytochrome P450 2A6 (CYPIIA6) (Coumarin 7-hydroxylase) (IIA3) 
(CYP2A3) (P450(I)). 


cytochrome P450-2A6 
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cytochrome P450, family 2, subfamily A, polypeptide 7 isoform 
1; cytochrome P450, subfamily IIA (phenobarbital -inducible} , 
polypeptide 7 


[Cytochrome P450 2A7 (CYPIIA7) (P450-IIA4) . j 


(cytochrome- P4 50 2A4 - human | 


| cytochrome P450 - human j 


| cytochrome P450 - human j 


P-450 IIA3 protein (1 is 3rd base in codon) 


cytochrome P450, family 2, subfamily S, polypeptide 1; 
cytochrome P450 family member predicted from ESTs; cytochrome 
P540, subfamily IIS , polypeptide 1; cytochrome P450, subfamily 
IIS, polypeptide 1 


cytochrome P-450 2C 


unnamed protein product 


[cytochrome P450 - human j 


cytochrome P450, family 2, subfamily A, polypeptide 7 isoform 
2; cytochrome P450, subfamily IIA (phenobarbital - inducible) , 
polypeptide 7 


cytochrome P450, family 2, subfamily J, polypeptide 2; 
cytochrome P450, subfamily IIJ (arachidonic acid epoxygenase) 
polypeptide 2; microsomal monooxygenase; f lavoprotein-linked 
monooxygenase 


cytochrome P450 2J2 


cytochrome P450-IIB 
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cytochrome P450 IID6 


cytochrome P450, subfamily IID, polypeptide 6; debrisoquine 
4 -hydroxylase; microsomal monooxygenase; xenobiotic 
monooxygenase ; 

f lavoprotein-linked monooxygenase; cytochrome P450, 
subfamily IID (debrisoquine, sparteine, etc., 
-metabolizing) -like 1 


S-mephenytoin 4 -hydroxylase 


cytochrome P-450 HPH (12 0 AA) 


Similar to hypothetical protein FLJ20359 


debrisoquine 4 -hydroxylase 


hypothetical protein FLJ20359 


cytochrome P450dbl 


cytochrome P450 epoxygenase 


(cytochrome P450 2A3 , hepatic - human ; 


cytochrome P-450 2C 


similar to cytochrome P450, family 2, subfamily r,. polypeptide 
1; cytochrome P450, 2rl 


cytochrome P450 CYPIBI 
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cytochrome P450, family 1, subfamily B, polypeptide 1; aryl 
hydrocarbon hydroxylase; cytochrome P450, subfamily I 
(dioxin-inducible), polypeptide 1 (glaucoma 3, primary 
infantile); microsomal monooxygenase; xenobiotic monooxygenase; 
flavoprotein-linked monooxygenase 


Similar to cytochrome P450, subfamily IIC (mephenytoin 
4 -hydroxylase) , polypeptide 9 


S-mephenytoin 4 -hydroxylase 


cytochrome P(l)-450 


cytochrome P450, family 1, subfamily A, polypeptide 1; aryl 
hydrocarbon hydroxylase; cytochrome P450, subfamily I (aromatic 
compound- inducible ) , polypeptide 1; flavoprotein-linked 
monooxygenase; cytochrome Pl-450, dioxin-inducible; P450 form 
6; xenobiotic monooxygenase; microsomal monooxygenase 


cytochrome P-450-1 




ATP-binding cassette, sub-family B, member 9 isoform 1 


ATP-binding cassette, sub-family B, member 9 isoform 2 


Similar to ATP-binding cassette, sub-family B (MDR/TAP) , member 
9 


unnamed protein product 


KIAA1520 protein 
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ABC -transporter 


TAP2B 


peptide transporter 


peptide transporter 


transporter 2, ATP-binding cassette, sub-family B isoform 1; 
transporter 2, ABC (ATP binding cassette); ATP-binding 
cassette, sub-family B (MDR/TAP), member 3; antigen peptide 
transporter 2; peptide supply factor 2; peptide transporter 
PSF2; ABC transporter, MHC 2 


peptide transporter 


Antigen peptide transporter 1 (APTl) (Peptide transporter TAPl) 
(Peptide transporter PSFl) (Peptide supply factor 1) (PSF-1) 
(Peptide transporter involved in antigen processing 1) . 


transporter 1, ATP-binding cassette, sub-family B; ATP-binding 
cassette, sub-family B (MDR/TAP) , member 2; antigen peptide 
transporter 1; ATP-binding cassette, sub-family B, member 2; 
transporter, ATP-binding cassette, major histocompatibility 
complex, 1; ABC transporter, MHC 1; peptide supply factor 1 


ABC transport protein 


TAP2E 


Antigen peptide transporter 2 (APT2) (Peptide transporter TAP2) 
(Peptide transporter PSF2) (Peptide supply factor 2) 
(PSF-2) (Peptide transporter involved in antigen processing 2) . 
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transporter 2 isoform 


transporter 2, ATP-binding cassette, sub-family B isoform 2; 
transporter 2, ABC (ATP binding cassette); ATP-binding 
cassette, sub-family B (MDR/TAP), member 3; antigen peptide 
transporter 2; peptide supply factor 2; peptide transporter 
PSF2 ; ABC transporter, 
MHC 2 


TAP 2 protein 


TAP 2 protein 


ATP-binding cassette, sub-family B, member 10 


mono ATP-binding cassette protein 


unnamed protein product 


unnamed protein product 


ATP-binding cassette, sub-family B, member 8; mitochondrial ABC 
protein 


ATP-binding cassette, sub-family B, member 8, mitochondrial 
precursor (Mitochondrial ATP-binding cassette 1) (M-ABCl) . 


P -glycoprotein 


hypothetical protein 
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ATP-binding cassette, sub-fatnily B (MDR/TAP), member 11; ABC 
memberl6, MDR/TAP subfamily; progressive familial intrahepatic 
cholestasis 2; bile salt export pump 


Multidrug resistance protein 1 (P-glycoprotein 1) (CD243 
antigen) . 


ATP-binding cassette, subfamily B, member 4 isoform A; P 
glycoprotein 3/multiple drug resistance 3; j 
P-glycoprotein-3/multiple drug resistance-3; multiple drug 
resistance 3 


ATP-binding cassette, subfamily B, member 4 isoform B; P 
glycoprotein 3/multiple drug resistance 3; 
P-glycoprotein-3/multiple drug resistance-3; multiple drug 
resistance 3 


ATP-binding cassette, subfamily B, member 4 isoform C; P 
glycoprotein 3/multiple drug resistance 3; 
P-glycoprotein-3/multiple drug resistance-3; multiple drug 
resistance 3 


Bile salt export pump (ATP-binding cassette, sub-family B, 
member 11) . 


bile salt export pump 


ATP-binding cassette, sub-family B (MDR/TAP), member 11; ABC 
member 16, MDR/TAP subfamily; progressive familial intrahepatic 
cholestasis 2; bile salt export pump 


TAP 2 protein 


Y3 


|MHC II Y3 gene 
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| Similar to zinc finger protein 268 | 




nuclear factor (erythroid-derived 2)-like 2 


| transcription factor Nrf2 1 


|FLJ00380 protein \ 


| nuclear factor (erythroid-derived 2)-like 1; transcription factor 1 1 (basic leucine zipper type) | 


| transcription factor Nrfl S 


| transcription factor LCR-F 1 . j 




connective tissue growth factor 


| bA69I8. 1 (connective tissue growth factor) 


| connective tissue growth factor j 


| nephroblastoma overexpressed gene j 


jnov precursor 


|CYR61 protein 


CYR61 protein precursor (Cysteine-rich, angiogenic inducer, 61) (Insulin-like growth 
factor-binding protein 10) (GIGl protein). 
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|Cyr61 protein 


WNTl inducible signaling pathway protein 1 isoform 1 precursor; wnt-1 signaling pathway 
protein 1; Wntl signaling pathway protein 1; Wnt-1 inducible signaling pathway protein 1 


| tumor RMS cell line RD specific product j 


WNTl inducible signaling pathway protein 3 isoform 2; wnt-1 signaling pathway protein 3; 
Wntl signaling pathway protein 3; Wnt-1 inducible signaling pathway protein 3 


WNTl inducible signaling pathway protein 3 isoform 1; wnt-1 signaling pathway protein 3; 
Wntl signaling pathway protein 3; Wnt-1 inducible signaling pathway protein 3 
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WNTl inducible signaling pathway protein 2 precursor; wnt-1 signaling pathway protein 2; 
connective tissue growth factor-like protein 




fibulin 5 precursor; urine p50 protein; developmental arteries 
and neural crest epidermal growth factor-like 


UP50 


unnamed protein product 


fibulin-4 


EGF-containing fibulin-like extracellular matrix protein 2 
precursor (Fibulin-4) (FIBL-4) (UPHl protein) . 


EGF-containing fibulin-like extracellular matrix protein 2; 
fibulin 4 


fibulin-like extracellular matrix protein 


PHI 


EGF-containing fibulin-like extracellular matrix protein 1 
isoform a precursor; f ibrillin-like 


|extracellular protein - human | 


EGF-containing fibulin-like extracellular matrix protein 1 
isoform b; f ibrillin-like 


hypothetical protein 


fibulin 2 
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Monocarboxylate transporter 2 (MCT 2). 


solute carrier family 16 (monocarboxylic acid transporters), member /; monocarboxylate 

transporter 2 , . 

monocarboxylate transporter 2; MCT2 
monocarboxylate transporter isoform 1 


solute carrier family 16 (monocarboxylic acid transporters), memberl 
monocarboxylate transporter 1 - human 

solute carrier family 16 (monocarboxylic acid transporters), member 3; monocarboxylate 
transporter 3 

I Monocarboxylate transporter 3 (MCT 3). 


solute carrier 16 (monocarboxylic acid transporters), member 8; monocarboxylate transporter 3 
solute carrier family 16 (monocarboxylic acid transporters), member 5; monocarboxylate 
transporter 5 
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5' nucleotidase, ecto; Purine 5-Prime-Nucleotidase; 5' nucleotidase (CD73); 
ecto-5'-nucleotidase 


|NT5E protein 


jectf)-5'-nucleotidase ' 
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dermatopontiri precursor - human 


Dermatopontin precursor (Tyrosine-rich acidic matrix protein) (TRAMP). | 


dermatopontin i 




similar to Amyloid beta A4 precursor protein-binding family B member 2 (Fe65-like 
protein) 


(Amyloid beta A4 precursor protein-binding family B member 2 (Fe65-like protein) | 


Similar to amyloid beta (A4) precursor protein-binding, family B, member 2 (Fe65-like 
protein) 


amyloid beta A4 precursor protein-binding, family B, member 1 isoform delta E9; 
amyloid beta A4 precursor protein-binding, family B, member 1; stat-iike protein; 
adaptor protein FE65a2 


amyloid beta A4 precursor protein-binding, family B, member 1 isoform E9; amyloid 
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(antithrombin) , member 1; antithrombin III 
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serine (or cysteine) proteinase inhibitor, clade A (alpha-1 
antiproteinase, antitrypsin) , member 5; Protein C inhibitor 
(plasminogen activator inhibitor-3) ; protein C inhibitor; 
protein C inhibitor (plasminogen activator inhibitor III) 
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solute carrier family 5 (low affinity glucose cotransporter), member 4; solute carrier 
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glucose cotransporter 


Na glucose cotransporter j 


solute carrier family 5 (sodium/glucose cotransporter), member 2; solute carrier family 
5 (sodium/glucose transporter), member 2 
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alpha 1 type I collagen preproprotein; Collagen I, alpha-1 
polypeptide; osteogenesis imperfecta type IV; collagen of skin, 
tendon and hone, alpha-1 chain 


alpha 1 (I) chain propeptide 


Unknown (protein for MGC: 33668) 


collagen alpha-1 (II) 


alpha 1 type III collagen; Collagen III, alpha-1 
polypeptide; collagen, fetal 


jcollagen alpha l(III) chain precursor j 


alphal (III) collagen 1 


pro- alpha (V) collagen (AA 1099) 


alpha 2 type V collagen preproprotein; Collagen V, alpha-2 
polypeptide; AB collagen; collagen, fetal membrane, A 
polypeptide 


type V preprocollagen alpha 2 chain ! 


Similar to collagen, type III, alpha 1 (Ehlers-Danlos syndrome 
type IV, autosomal dominant) 


pro-alpha 1 (II) collagen (313 AA; AA 975-271c) 


Similar to collagen, type V, alpha 2 
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alpha-l type III collagen 


type I collagen pro alpha 1(1) chain propeptide 


carboxy-propeptide of alpha 1 (III) procollagen 


C -terminal propeptide domain 


alpha-2 type V collagen 


prepro-alpha-2 chain 


|Collagen alpha 1(XI) chain precursor j 


jcollagen alpha 1(XI) chain precursor j 


collagen type XI alpha-a isoform B 


alpha 1 type XI collagen isoform B preproprotein; collagen XI, 
alpha-l polypeptide 


collagen type XI alpha-l 


alpha 1 type XI collagen isoform C preproprotein; collagen XI, 
alpha-l polypeptide 


collagen type XI alpha-l isoform A 


alpha 1 type XI collagen isoform A preproprotein; collagen XI, 
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alpha 1 type V collagen preproprotein 
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C0L5A1 protein 


Collagen alpha 1 (V) chain precursor j 


collagen alpha 1 (V) chain precursor j 




chromosome 1 1 open reading frame 24 


jchromosome 1 1 open reading frame 24 j 






inducible T-cell co-stimulator; activation-inducible lymphocyte immunomediatory 
molecule; inducible costimulator 


| Similar to inducible T-cell co-stimulator j 


heat shock 70kDa protein 1 A; heat shock 70kD protein 1A; heat shock-induced 
protein; dnaK-type molecular chaperone HSP70-1 


| Heat shock 70 kDa protein 1 (HSP70.1 ) (HSP70-1 /HSP70-2) j 


| heat shock 70kDa protein 1B; heat shock 70kD protein 1B | 


| dnaK-type molecular chaperone HSPA1 L | 


| heat shock 70kD protein 1-like j 


| Heat shock protein 70 testis variant j 


| heat shock 70kDa protein 1-like; Heat-shock 70kD protein-like-1 \ 


| similar to heat shock protein j 


| heat shock 70kD protein 1-like I 


| heat shock 70kDa protein 2; heat shock 70kD protein 2; Heat-shock 70kD protein-2 j 


| Unknown (protein for MGC:33922) j 


| Heat shock protein 70 testis variant j 


| Similar to heat shock cognate 71 -kd protein j 


| heat shock protein j 
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heat shock 70kDa protein 6 (HSP70B') | 


heat shock 70kDa protein 6 (HSP70B 1 ); heat shock 70kD protein 6 (HSP70B 1 ); 
Heat-shock 70kD protein-6 (HSP70B 1 ) 


annexin A2; annexin II; lipocortin II; Annexin II (lipocortin 1); calpactin 1, heavy 
polypeptide (p36); annexin II (lipocortin II; calpactin 1, heavy polypeptide); annexin II 
(lipocortin II) 


|annexin A2 j 


1 annexin A2 j 


bA255A1 1 .8 (novel protein similar to annexin A2 (ANXA2) (lipocortin II, calpactin I 
heavy chain, chromobindin 8, PAP-IV)) 


| annexin I; annexin I (lipocortin I); lipocortin I J 


| Annexin I j 


annexin IV; annexin IV (placental anticoagulant protein II); placental anticoagulant 
protein II 


Annexin A4 (Annexin IV) (Lipocortin IV) (Endonexin 1) (Chromobindin 4) (Protein II) 
(P32.5) (Placental anticoagulant protein II) (PAP-II) (PP4-X) (35-beta calcimedin) 
(Carbohydrate-binding protein P33/P41) (P33/41) 


| protein PP4-X j 


|annexin A1 1 ; annexin XI; autoantigen, 56-kD; calcyclin-associated annexin 50 


|annexin VI isoform 2; annexin VI (p68); calcium-binding protein p68; calphobindin II; | 


|protein p68 (1 - 673) 


Annexin Vi; Chain: A; Synonym: Lipocortin Vi, P68, P70, Protein III, Chromobindin 20, 
67 Kda Calelectrin, Calphobindin-ll, Cpb- II 


annexin VI isoform 1; annexin VI (p68); calcium-binding protein p68; calphobindin II; 
calelectrin F498 


Annexin VI (Lipocortin VI) (P68) (P70) (Protein III) (Chromobindin 20) (67 kDa 
calelectrin) (Calphobindin-ll) (CPB-II) 


|Annexin III j 
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annexin A3; Annexin III (lipocortin III); annexin III (lipocortin III, 

1 ,2-cyclic-inositol-phosphate phosphodiesterase, placental anticoagulant protein III, 

calcimedin 35-alpha); calcimedin 35-alpha 


Annexin V (Lipocortin V, Endonexin II, Placental Anticoagulant Protein) (Calcium Ions 
Are Visible) Mutation With Glu 17. Replaced By Gly (E17G) 


annexin V; endonexin II; anchorin CM; lipocortin V; placental anticoagulant protein I j 


Annexin V J 


Annexin V (Lipocortin V, Endonexin II, Placental Anticoagulant Protein) Mutant With. 
Glu 17 Replaced By Gly, Glu 78 Replaced By Gin (E17G, E78Q) Complexed With 
Calcium 


[annexin A5 j 


Annexin V (Lipocortin V, Endonexin II, Placental Anticoagulant Protein) (Calciumjons 
Are Visible) Mutant With Glu 78 Replaced By Gin (E78Q) (Second Crystal Form) 


jannexin VIII -human ] 


jannexin VIII; Annexin VII j 


Annexin V; Chain: Null; Engineered: Yes; Mutation: P13, P87, P119, P163, and P248 
Substituted With Thioproline (Prs); BiologicalJJnit: Monomer 


Isimilar to annexin A8 ' 


|anextn VIII 


jannexin VII isoform 2; annexin VII (synexin); synexin 


jannexin VII isoform 1; annexin VII (synexin); synexin 


|similar to annexin A8 


| annexin A13 isoform b 


|annexin A13; annexin XIII; annexin, intestine-specific 


| Unknown (protein for MGC:1925) 


| keratinocyte annexin-like protein 


Jannexin 31 ; annexin XXXI 
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intestinal fatty acid binding protein 2; Fatty acid-binding protein, intestinal; l-FABP; fatty 
acid binding protein 2, intestinal 


Intestinal Fatty Acid Binding Protein; Chain: A; Synonym: l-Fabp ( 


cytochrome P450, subfamily XVII polypeptide; steroid 17-alpha-monooxygenase; 
steroid 17-alpha-hydroxylase/17,20 lyase; cytochrome p450 XVIIA1 


cytochrome P450c1 7 j 


| steroid 1 7-alpha-hydroxylase | 


cytochrome P(1 )-450 ] 


cytochrome P450, subfamily I (aromatic compound-inducible), polypeptide 1 ; 
flavoprotein-linked monooxygenase; cytochrome P1-450, dioxin-inducible; aryl 
hydrocarbon hydroxylase; P450 form 6; xenobiotic monooxygenase; microsomal 
monooxygenase 


(cytochrome P-450-1 j 


| cytochrome P-450-1 | 


cytochrome P450, subfamily I (aromatic compound-inducible), polypeptide 2; 
dioxin-inducible P3-450; P450 form 4; microsomal monooxygenase; xenobiotic 
monooxygenase; aryl hydrocarbon hydroxylase; flavoprotein-linked monooxygenase 


[cytochrome P450-1A2 


| cytochrome P450 4 


|Cytochrome P450 XXIB (Steroid 21 -hydroxylase) (P450-C21B) 


| CYP21B protein 


cytochrome P450, subfamily XXIA polypeptide 2; steroid 21 -monooxygenase; steroid 
21 -hydroxylase 


1 21 -hydroxylase B 


| mutant 21 -hydroxylase B 


| steroid 21 -monooxygenase 
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aldehyde dehydrogenase [NAD(P)] (EC 1.2.1 .5) 3 - 


aldehyde dehydrogenase 3 I 


unnamed protein product j 


aldehyde dehydrogenase 3B1 ; aldehyde dehydrogenase 7; aldehyde dehydrogenase 3 
family, member B1 


Similar to aldehyde dehydrogenase 3 family, member B1 j 


aldehyde dehydrogenase 3B2; aldehyde dehydrogenase 8; aldehyde dehydrogenase 3 
family, member B2 


i Similar to aldehyde dehydrogenase 3 family, member B2 j 


| unnamed protein product | 




homocysteine-inducible, endoplasmic reticulum stress-inducible, ubiquitin-like domain 
member 1; MMS-inducible gene 


| unknown j 


| unknown j 


Similar to homocysteine-inducible, endoplasmic reticulum stress-inducible, 
ubiquitin-like domain member 1 


| hypothetical protein FLJ223 1 3 


TLS-CHOP 


|chimeric cDNAfrom Myxoid liposarcoma 


inducible transcript 3; C/EBP homologous protein; growth arrest- and 
nducible 


inducible protein GADD153 - human 


DNA-damage- 
DNA damage-i 


|DNA-damage- 
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Type I iodothyronine deiodinase (Type-I 5'deiodinase) (DIOI) (Type 1 Dl) (5DI) 


jthyroxine deiodinase type 1 ; 5DI; thyroxine deiodinase type I (selenoprotein) j 


(Similar to deiodinase, iodothyronine, type I j 


hypothetical protein MGC4504 


sialyltransferase 9 (CMP-NeuAc:lactosylceramide alpha-2,3-sialyltransferase; GM3 
synthase); ganglioside G(M3) Synthase 


|CMP-NeuAc:lactosylceramide alpha-2,3-sialyltransferase 


| sialyltransferase 6 (N-acetyllacosaminide alpha 2,3-sialyltransferase) 


| Gal beta 1 ,3(4) GlcNAc alpha 2,3-sialyltransferase j 


carnitine acetyltransferase precursor,, isoform 1 


| Carnitine O-acetyltransferase (Carnitine acetylase) (CAT) 


| carnitine acetyltransferase | 


| carnitine acetyltransferase isoform 2 ] 


| carnitine acetyltransferase precursor, isoform 3 j 


| Choline O-acetyltransferase (CHOACTase) (Choline acetylase) (ChAT) j 


isoform S ' 


isoform 2; acetyl CoAxholine O-acetyltransferase | 


soform R ! 


soform 1 ; acetyl CoA:choline O-acetyltransferase j 


choline acetyltransferase - human (fragment). j 


| choline acetyltransferase i 


jcholine acetyltransferase i 


jcholine acetyltransferase i 


[choline acetyltransferase i 
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X-ray repair cross complementing protein 2; X-ray repair, complementing defective, 
repair in Chinese hamster; DNA repair protein XRCC2 


elongation of very long chain fatty acids (FEN1/Elo2, SUR4/EI03, yeast)-HKe 2 


dJ483K1 6.1.1 (novel protein (isoform 1)) _ 1 


homoloq of yeast long chain polyunsaturated fatty acid elongation J 


elongation of very long chain fatty acids (FEN1/Elo2, SUR4/Elo3, yeast)-like 4; 
Stargardt disease 3 (autosomal dominant) 


jdJ92C4.1 (novel protein, partly predicted by Fgenesh and Genscan) I 


CD3Z antigen, zeta polypeptide (TiT3 complex) 


|T-cell surface glycoprotein CD3 zeta chain precursor (T-cell receptor T3 zeta chain) 1 


|T-cell receptor zeta chain precursor j 


|T-cell receptor zeta chain precursor | 


fatty acid desaturase 2; delta-6 fatty acid desaturase; linoleoyl-CoA desaturase 
(delta-6-desaturase)-like 2 


1 fatty acid desaturase 2 
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Iprobable delta-6 fatty acid desaturase (EC 1 .14.99.-) - human (fragment). 


fatty acid desaturase 3; delta-6 fatty acid desaturase; linoleoyl-CoA desaturase 
(delta-6-desaturase)-like 3 


I unnamed protein product 


I unnamed protein product 


fatty acid desaturase 1 ; linoleoyl-CoA desaturase (delta-6-desaturase)-like 1 ; delta-5 
desaturase; delta-5 fatty acid desaturase 


| fatty acid desaturase 1 


I delta-5 fatty acid desaturase 


| unnamed protein product 
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delta-5 desaturase I 


unnamed protein product j 


|BC269730_1 j 


unnamed protein product j 


VDUP1 


|thioredoxin interacting protein; upregulated by 1,25-dihydroxyvitamin D-3 j 


| similar to RIKEN cDNA 2410003C09 gene j 
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| KIAA1 376 protein | 


| similar to hypothetical protein CLONE24945 j 
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differentiation gene 1; DKFZP564F1862 protein; endoplasmic reticulum DnaJ homolog 
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(PID:g1771560) 
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platelet-derived growth factor receptor beta precursor; beta platelet-derived growth 
factor receptor 


| platelet-derived growth factor receptor 


| platelet-derived growth factor receptor, beta polypeptide ! 


| KIT protein j 


j v-kit Hardy-Zuckerman 4 feline sarcoma viral oncogene homolog precursor 


colony stimulating factor 1 receptor, formerly McDonough feline sarcoma viral (v-fms) 
oncogene homolog 


Macrophage colony stimulating factor I receptor precursor (CSF-1-R) (Fms 
proto-oncogene) (c-fms) (CD1 1 5 antigen) 


| FLT3 receptor tyrosine kinase I 


jfms-related tyrosine kinase 3 


| protein-tyrosine kinase (EC 2.7.1.112) STK-1 precursor - human 


| Unknown (protein for MGC:1451 9) | 


| vascular endothelial growth factor receptor 


fms-related tyrosine kinase 1 (vascular endothelial growth factor/vascular permeability 
factor receptor) 


fms-related tyrosine kinase 4; fms-related tyrosine kinase-4 (vascular endothelial 
growth factor receptor 3) 
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Vascular endothelial growth factor receptor 3 precursor (VEGFR-3) (Tyrosine-protein 
kinase receptor FLT4) 


| vascular endothelial growth factor receptor 2 j 


kinase insert domain receptor (a type III receptor tyrosine kinase); Kinase insert 
domain receptor 


|protein-tyrosine kinase (EC 2.7.1 .112) KDR - human ! 


| receptor tyrosine kinase - human (fragment). ] 
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adrenoieukodystrophy related protein 


ATP-binding cassette, sub-family D, member 2; adrenoleukodystrophy-like 1 ; hALDR | 


ATP-binding cassette, sub-family D (ALD), member 1 ; adrenoieukodystrophy protein I 


adrenoieukodystrophy protein f 


Adrenoieukodystrophy protein (ALDP) ! 


70-kd peroxisomal membrance protein homolog {internal fragment} [human, Peptide 
Partial, 386 aa] 


adrenoieukodystrophy related protein ] 


ATP-binding cassette, sub-family D, member 3; Peroxisomal membrane protein-1 
(70kD); peroxisomal membrane protein 1 (70kD, Zellweger syndrome); peroxisomal 
membrane protein-1 


peroxisomal membrane protein, 70K - human j 


70kD peroxisomal integral membrane protein 


SWI/SNF-related matrix-associated actin-dependent regulator of chromatin a-like 1; 
HepA-related protein; SMARCA-like protein 1 


SWI/SNF related, matrix associated, actin dependent regulator of chromatin, subfamilvl 


a-like 1 


HepA-related protein HARP 


hypothetical protein DKFZp434B1050.1 - human (fragment) 


unnamed protein product 


unnamed protein product 


Similar to RIKEN cDNA 1700018018 gene 


unnamed protein product 


(Similar to RIKEN cDNA 1700018018 gene 


(similar to F16H1 1.1.p 


dynein, cytoplasmic, light polypeptide; Dynein, cytoplasmic, light chain (protein inhibitor 
of neuronal NOS); protein inhibitor of neuronal nitric oxide synthase 
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sema domain, seven thrombospondin repeats (type 1 and type Mike), transmembrane 
domain (TM) and short cytoplasmic domain, (semaphorin) 5A; semaphorin F; sema 
domain, seven thrombospondin repeats (type 1 and type 1-like), transmembrane 
domain (TM) and short cytoplasmic domain, 5A 


KIAA1 445 protein I 


similar to KIAA1445 protein 


semaphorin F I 


hypothetical protein MGC2605 


|similartoHAGH | 


hydroxyacyl glutathione hydrolase; hydroxyacyl glutathione hydrolase; glyoxalase 2; 
Hydroxyacyl glutathione hydrolase; glyoxalase II; hydroxyacylglutathione hydroxylase 


| unnamed protein product I 


TBP-associated factor 6 isoform alpha; TAF6 RNA polymerase II, TATA box binding 
protein (TBP)-associated factor, 80 kD; TATA box binding protein (TBP)-associated 
factor, RNA polymerase II, E, 70/85kD; transcription initiation factor TFIID 70 kD 
subunit 


TBP-associated factor 6 isoform gamma; TAF6 RNA polymerase II, TATA box binding 
protein (TBP)-associated factor, 80 kD; TATA box binding protein (TBP)-associated 
factor, RNA polymerase II, E, 70/85kD; transcription initiation factor TFIID 70 kD 
subunit 


TBP-associated factor 6 isoform delta; TAF6 RNA polymerase II, TATA box binding 
protein (TBP)-associated factor, 80 kD; TATA box binding protein (TBP)-associated 
factor, RNA polymerase II, E, 70/85kD; transcription initiation factor TFIID 70 kD 
subunit 
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Unknown (protein for MGC:21 163) 


protein kinase I 


serum/glucocorticoid regulated kinase-like; cytokine-independent survival kinase f 


protein kinase I 


serum/glucocorticoid regulated kinase 2 isoform beta I 


v-akt murine thymoma viral oncogene homolog 3 (protein kinase B, gamma); protein 
kinase B 


monoglyceride lipase; lysophospholipase-like; likely ortholog of mouse monoglyceride 
lipase 


| monoglyceride lipase I 


neutral amino acid transporter 


solute carrier family 1 (glutamate/neutral amino acid transporter), member 4; Solute 
carrier family 1 (glutamate/neutral amino acid transporter), 


| neutral amino acid transporter - human 


I hASCTI 


| Na+-dependent neutral amino acid transporter SATT 


solute carrier family 1 (neutral amino acid transporter), member 5; baboon M7 virus 
receptor; RD1 14 virus receptor; neutral amino acid transporter B 


| neutral amino acid transporter 


jsolute carrier family 1 (neutral amino acid transporter), member 5 


| sodium-dependent neutral amino acid transporter type 2 truncated isoform 


| neutral amino acid transporter B 


|RD114/simian type D retrovirus receptor 


membrane-associated guanylate kinase-related 3 


similar to membrane-associated guanylate kinase MAGI3 
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CCAAT/enhancer binding protein (C/EBP), delta 


transcription activator NF-IL6 beta - human ! 


similar to CCAAT/enhancer binding protein delta (C/EBP delta) (Nuclear factor 
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transforming growth factor beta-stimulated protein TSC-22 


| cerebral protein-2 | 


BCR downstream signaling 1 


Solute carrier family 12 member 3 (Thiazide-sensitive sodium-chloride cotransporter) 
(Na-CI symporter) 


|NaCI electroneutral Thiazide-sensitive cotransporter j 


solute carrier family 12 (sodium/chloride transporters), member 3; Solute carrier family 
12 (sodium/pptassium/chloride transporters), 


solute carrier family 12 (sodium/potassium/chloride transporters), member 2; Solute 
carrier family 12 (sodium/potassium/chloride transporters), 


sodium potassium chloride cotransporter 2; Solute carrier family 12 
(sodium/potassium/chloride transporters), 


jlar to solute carrier family 12 (sodium/potassium/chloride transporters), member 2 


|thiazide-sensitive sodium-chloride cotransporter - human (fragment) ! 


solute carrier family 12 (potassium/chloride transporters), member 7; 
potassium/chloride transporter KCC4 
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diacylglycerol O-acyltransferase 2 like 1; diacylglycerol acyltransferase 2-like 


hypothetical protein . . 


Unknown (protein for MGC:1 7861 ) I 
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hypothetical protein FLJ22644 | 
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ubiquitin-conjugating enzyme E2C; ubiquitin carrier protein E2-C 


Ubiquitin-Conjugating Enzyme E2 H10; Chain: A, B; Synonym: Ubiquitin-Conjugating 
Enzyme UbchIO; Ec: 6.3.2.19; 


|dJ447F3.2.4 (ubiquitin-conjugating enzyme E2 H10 (isoform 4)) j 


CYR61 protein 


CYR61 protein precursor (Cysteine-rich, angiogenic inducer, 61) (Insulin-like growth 
factor-binding protein 10) (GIG1 protein) 
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cysteine-rich, angiogenic inducer, 61 ; cysteine-rich heparin-binding protein 61 ; 
cysteine-rich, anigogenic inducer, 61 
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WNT1 inducible signaling pathway protein 3, isoform 2; Wnt1 signaling pathway, 
protein 3; lost in inflammatory breast cancer tumor suppressor protein; connective 
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serine (or cysteine) proteinase inhibitor, clade A (alpha-1 antiproteinase, antitrypsin), 
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Chain A, Crystal Structure Of P681h Mutant Of Tir Domain Of Human Tlr2. | 
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cytochrome P450, subfamily IIC (mephenytoin 4-hydroxylase), polypeptide 19; 
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Leukotriene B4 omega-hydroxylase \ 


tochrome P450, subfamily IVF, polypeptide 2; leukotriene B4 omega-hydroxylase; 
leukotriene-B4 20-monooxygenase 


cytochrome P450, subfamily IVF, polypeptide 3; leukotriene B4 omega hydroxylase; 
leukotriene-B4 20-monooxygenase; cytochrome P450-l_TB-omega 


cytochrome P450 4F2 i 


cytochrome P-450 j 


jCytochrome P450 4F11 (CYPIVF11) | 


cytochrome P450, subfamily IVF, polypeptide 1 1 | 


| Cytochrome P450 4F12 (CYPIVF12) \ 


cytochrome P450 isoform 4F12 j 


| similar to cytochrome P450 j 


|F22329_1 ! 


cytochrome P450, subfamily IVF, polypeptide 8; microsomal monooxygenase; 
flavoprotein-linked monooxygenase 


| hypothetical protein j 


Isimilar to CYTOCHROME P450 4F6 (CYPIVF6) ! 


Isimilar to Cytochrome P450 4F12 (CYPIVF12) | 


|Similar to cytochrome P450, subfamily IVA, polypeptide 1 1 


Isimilar to Cytochrome P450 4F12 (CYPIVF12) 


| unnamed protein product 
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[cytochrome P450 


| truncated cytochrome P450 


Glucocorticoid-induced leucine zipper protein (Delta sleep-inducing peptide 
immunoreactor) (DSIP-immunoreactive peptide) (DIP protein) (hDIP) (TSC-22-like 
protein) (TSC-22R) 


(hypothetical protein DKFZp566A093.1 , human j 
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|vasopressin receptor type 2 


| vasopressin receptor:ISOTYPE=V2 


| arginine vasopressin receptor 2 


| oxytocin receptor 


| oxytocin receptor 


| oxytocin receptor 


arginine vasopressin receptor 1B; arginine vasopressin receptor 3; antidiuretic 
hormone receptor 1B; vasopressin V1B receptor; pituitary vasopressin receptor 3 


arginine vasopressin receptor 1A; Via vasopressin receptor; vascular/hepatic-type 
arginine vasopressin receptor; antidiuretic hormone receptor 1A 


| C-reactive protein, pentraxin-related; C-reactive protein I 


IC-Reactive Protein; Chain: A, B, C, D, E, F, G, H, I, J | 


| similar to C-reactive protein precursor j 


| Serum Amylo d P Component (Sap) ! 


serum amyloid P component precursor; amyloid P component, serum; 
pentaxin-related; 9.5S alpha-1 -glycoprotein 


pre-serum amyloid P component 
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nucleolar autoantigen (55kD) similar to rat synaptonemal complex j 


cytochrome P450, subfamily IIC (mephenytoin 4-hydroxylase), polypeptide 18; 
cytochrome P450, subfamily IIC (mephenytoin 4-hydroxylase), polypeptide 17; 
microsomal monooxygenase; flavoprotein-linked monooxygenase 


| Cytochrome P450 2C18 (CYPIIC18) (P450-6B/29C) | 


cytochrome P450, subfamily IIC (mephenytoin 4-hydroxylase), polypeptide 19; 
mephenytoin 4'-hydroxylase; microsomal monooxygenase; xenobiotic 
monooxygenase; flavoprotein-linked monooxygenase 


Cytochrome P450 2C8 (CYPIIC8) (P450 form 1) (P450 MP-12/MP-20) (P450 IIC2) 
(S-mephenytoin 4-hydroxylase) 


| Unknown (protein for MGC:22146) j 


cytochrome P450, subfamily IIC, polypeptide 9; cytochrome P450, subfamily IIC 
(mephenytoin 4-hydroxylase), polypeptide 10; mephenytoin 4-hydroxylase; microsomal 
monooxygenase; xenobiotic monooxygenase; flavoprotein-linked monooxygenase 


jcytochrome P-450 [Homo sapiens] j 


jcytochrome P-450 S-mephenytoin 4-hydroxylase j 


jcytochrome P-450 [Homo sapiens] j 


cytochrome P450, subfamily IIC, polypeptide 8 isoform 1; mephenytoin 4-hydroxylase; 
microsomal monooxygenase; xenobiotic monooxygenase; flavoprotein-linked 
monooxygenase; P450 form 1 


jcytochrome P450 2C8 - human. i 


cytochrome P450 arachidonic acid epoxygenase isoform, Cyp 2C8 [human, kidney, 
Peptide Partial, 485 aa] 


jcytochrome P-450 S-mephenytoin 4-hydroxylase 


| S-mephenytoin 4'-hydroxylase (EC 1.14.14.-) cytochrome P450 2C19 


Cytochrome P450 2C10 (CYPIIC10) (P450 MP-8) (S-mephenytoin 4-hydroxylase) 
(P-450MP) 
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jSimilar to lipin 1 


j lipin 1 i 


Lipin 1 


Cytochrome P450 7A1 (Cholesterol 7-alpha-monooxygenase) (CYPVII) (Cholesterol 
7-alpha-hydroxylase) 


| cytochrome P450, subfamily VIIB, polypeptide 1 ; oxysterol 7alpha-hydroxylase ! 


oxysterol 7alpha-hydroxylase 


(hypothetical protein TSC501 [imported] | 


| kidney- and liver-specific gene . I 


|GLA | 


| putative N-acetyltransferase Camello 2 j 


N-acetyltransferase 8; kidney- and liver-specific gene product; kidney- and 
liver-specific gene 


|cytochrome P450 - human « 
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cytochrome P450 


cytochrome P450, subfamily IVA, polypeptide 11; fatty acid omega-hydroxylase; 
P450HL-omega; alkane-1 monooxygenase; lauric acid omega-hydroxylase 


|fatty acid omega-hydroxylase (EC 1.14.15.-) cytochrome P450 4A1 1 - human j 


Cytochrome P450 4A11 precursor (CYPIVA1 1) (Fatty acid omega-hydroxylase) (P-450 
HK omega) (Lauric acid omega-hydroxylase) (CYP4AII) (P450-HL-omega) 


| fatty acid omega-hydroxylase j 


| fatty acid omega-hydroxylase CYP4A1 1 j 


|dJ18D14.4 (cytochrome P450, subfamily IVA, polypeptide 11) ] 


| unnamed protein product j 


| Unknown (protein for MGC:40051 ) | 


amylase, alpha 2A; pancreatic; Amylase, pancreatic, alpha-2A 


|amylase, alpha 2B; pancreatic; Amylase, pancreatic, alpha-2B ! 


| similar to Alpha-amylase, salivary precursor (1 ,4-alpha-D-glucan glucanohydrolase) j 


| alpha-amylase (EC 3.2.1.1) precursor, salivary - human 


| amylase, alpha 1A; salivary; Amylase, salivary, alpha-1A 


Chain A, Structure Of Human Pancreatic Alpha-Amylase In Complex With The 
Carbohydrate Inhibitor Acarbose 


Chain , Moljd: 1; Molecule: Human Pancreatic Alpha-Amylase; Chain: Null; Ec: 
3.2.1.1 


Chain A, Three Dimensional Structure Analysis Of The R195q Variant Of Human 
Pancreatic Alpha Amylase 
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|CD36 antigen (collagen type 1 receptor, thrombospondin receptor) j 


cell adhesion receptor CD36 j 


CD36 antigen (collagen type 1 receptor, thrombospondin receptor); CD36 antigen 
(collagen type 1) 


Platelet glycoprotein IV (GPIV) (GPIIIB) (CD36 antigen) (PAS IV) (PAS-4 protein) 


| alpha-amylase 


| Chain A, Role Of Ethe Mobile Loop In The Mehanism Of Human Salivary Amylase 


jChain A, Role Of Mobile Loop In The Mechanism Of Human Salivary Amylase | 


| Chain , Human Salivary Amylase j 


Chain A, Mechanistic Analyses Of Catalysis In Human Pancreatic Alpha- Amylase: 
Detailed Kinetic And Structural Studies Of Mutants Of Three Conserved Carboxylic 
Acids 


Chain A, Three Dimensional Structure Analysis Of The R337a Variant Of Human 
Pancreatic Alpha-Amylase 


Chain A, Three Dimensional Structure Analysis Of The R195a Variant Of Human 
Pancreatic Alpha Amylase 


Chain A, Mechanistic Analyses Of Catalysis In Human Pancreatic Alpha- Amylase: 
Detailed Kinetic And Structural Studies Of Mutants Of Three Conserved Carboxylic 
Acids 


Chain A, Subsite Mapping Of The Active Site Of Human Pancreatic Alpha-Amylase 
Using Substrates, The Pharmacological Inhibitor Acarbose, And An Active Site Variant 


Chain A, Three Dimensional Structure Analysis Of The R337q Variant Of Human 
Pancreatic Alpha-Mylase 
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beta-galactos dase binding lectin precursor; Lectin, galactose-binding, soluble, 1; 
galectin 


G protein-coupled receptor 91 


P2Y purinoceptor 1. 


Chain A, Crystal Structure Of Human Dehydroepiandrosterone Sulfotransferase In 
Complex With Substrate 


| hydroxysteroid sulfotransferase SULT2B1b | 


sulfotransferase family, cytosolic, 2B, member 1; sulfotransferase family 2B, member 
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hydroxysteroid sulfotransferase SULT2B1a 


| MHC class II HLA-DQ-beta-1 I 


| MHC class II HLA-DQ-beta-1 | 


| human leukocyte antigen-DQ beta chain I 


| HLA class II histocompatibility antigen, DX beta chain precursor 


| MHC class II histocompatibility antigen HLA-DQ beta chain (DQ4) precursor - human | 


| MHC class II histocompatibility antigen DQ-beta chain precursor - human 


| lymphocyte antigen 


HLA class II histocompatibility antigen, DQB1*0602 beta chain precursor (DQ(5)) 
(DC-1) 
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dopachrome tautomerase (dopachrome delta-isomerase, tyrosine-related protein 2); 
Dopachrome tautomerase (dopachrome delta-isomerase; tyrosinase-related protein 2) 


pre propeptide (AA -24 to 503) j 


| tyrosinase-related protein 1 | 


|bA3L8.1 (tyrosinase-related protein 1) j 
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|BAC06178.1 


|AAB36682.1 


NP_5 10966.1 


1 137225 


[decay-accelerating factor 3 


decay-accelerating factor 4ab 


| decay-accelerating factor 1 ab j 


Chain R, Structural Model Of Human Decay-Accelerating Factor Bound To Echovirus 
7 From Cryo-Electron Microscopy 


| decay-accelerating factor, splice form 1 precursor - human j 


| decay-acceleration factor I 


| decay-accelerating factor precursor j 


(Complement decay-accelerating factor precursor (CD55 antigen) i 


decay accelerating factor for complement (CD55, Cromer blood group system); 
Decay-accelerating factor of complement 


|R29368_2 j 


jegf-like modu e containing, mucin-like, hormone receptor-li 


jegf-like modu e containing, mucin-like, hormone receptor-li 


| egf-like modu e containing, mucin-like, hormone receptor-l 


jegf-like modu e containing, mucin-like, hormone receptor-l 


jegf-like modu e containing, mucin-like, hormone receptor-l 


egf-like modu e containing, mucin-like, hormone receptor-like sequence 1; egf-like 
module containing, mucin-like, hormone receptor-like 


| Leucocyte an igen CD97 precursor 


| seven transmembrane helix receptor \ 


| seven transmembrane helix receptor . j 


|CD97 | 


CD97 antigen isoform 1 precursor; leukocyte antigen CD97; seven-span 
transmembrane protein 


| leucocyte ant gen CD97 


ike sequence 2 i 


ike sequence 2 i 


ike sequence 2 i 


ike sequence 2 


ike sequence 2 i 


soform f j 


soform g j 


isoform d j 


isoform e ! 


isoform c j 
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PP3774 


|Similar to RIKEN cDNA 1500015N03 gene | 


jsimilar to Abl-philin 2 


| hypothetical protein MGC2993 


B-cell lymphoma 6 protein; B-cell .CLL/lymphoma-6; cys-his2 zinc finger transcription 
factor BCL5; zinc finger protein 51 ; lymphoma-associated zinc finger gene on 
chromosome 3 


| B-cell CLL/lymphoma 6 (BCL6) protein \ 


| BAZF | 


jsimilar to BcL6-associated zinc finger protein j 


hematopoieticaliy expressed homeobox; proline-rich homeodomain-containing 
transcription factor 


| homeobox protein HEX - human j 


[Similar to hematopoieticaliy expressed homeobox j 


| homeobox related protein j 


similar to Glutathione S-transferase Mu 1 (GSTM1-1) (HB subunit 4) (GTH4) 
(GSTM1a-1a) (GSTM1b-1b) (GST class-mu 1) 


Chain A, Ligand-Free Human Glutathione S-Transferase M1a-1a 

glutathione transferase M1 1 


AAG23766.1 


|AAH08074.1 I 


|AAH33157.1 j 


|NP_1 15703.1 ; 


NP_00 1697.2 


|A48752 J 


|BAC00962.1 | 


|XP_171849.1 | 


NP_002720.1 


|JN0767 j 


|AAH14336.1 | 


ICAA79730.1 | 


1 
& 


4388890 
AAA59203.1 


F:(C-HI) 

-1.7, 

F:(C-D) 

-2.35, 

U:(HI-D) 

2.52 








F:(C-D) 
-4.15, 
U:(HI-D) 
2.11 








F:(C-D) 
-2.62, 
U:(HI-D) 
2.05 








F:(C-D) 
-2.27 
U:(C-D) 
2.17 
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NP_076229.1 
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NP_032271.1 
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NP_032209.1 
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NM_008182 
NP_032208.1 


















R(C-HI) 
-4.31, 
F:(C-D) 
-5.26 
















R(C-HI) 
-9.17, 
R(C-D) 
-5.68 






Subclass: cytochrome P450 2C 17 j 


iSubclass: Cytochrome P450 2C8 (CYPIIC8) (P450 form 1) (P450 MP-12/MP-20) (P450 IIC2) (S-mephenytoin 4-hvdroxvlase) I 


Subclass: cytochrome P450, subfamily IIC (mephenytoin 4-hydroxylase), polypeptide 19; mephenytoin 4-hydroxylase; 
microsomal monooxygenase; xenobiotic monooxygenase; flavoprotein-linked monooxygenase 


| Subclass: Cytochrome P450 2C10 (CYPIIC10) (P450 MP-8) (S-mephenytoin 4-hydroxylase) (P-450MP) [ 


Subclass: cytochrome P450, subfamily IIC, polypeptide 9; cytochrome P450, subfamily IIC (mephenytoin 4-hydroxylase), 
polypeptide 10; mephenytoin 4-hydroxylase; microsomal monooxygenase; xenobiotic monooxygenase; flavoprotein-linked 
monooxygenase 


Subclass: cytochrome P450, subfamily IIC (mephenytoin 4-hydroxylase), polypeptide 18; cytochrome P450, subfamily IIC 
(mephenytoin 4-hydroxylase), polypeptide 17; microsomal monooxygenase; flavoprotein-linked monooxygenase 


cytochrome P-450 




I Subclass: 


|Subclass: 


|Subclass; 


jSubclass: 


| Subclass: 


Subclass: glutathione S-transferase A1 ; GST, class alpha, 1 ; glutathione S-alkyltransferase A1 ; glutathione S-aryltransferase 
A1; S-(hydroxyalkyl)glutathione lyase A1; glutathione S-aralkyltransferase A1 ; GST-epsilon; glutathione S-transferase 2 


glutathione transferase 




: glutathione transferase (EC 2.5.1.18) alpha-2 (clone GTH2)- human | 


; glutathione S-transferase A3 j 


; Glutathione S-transferase A3-3 (GST class-alpha) j 


: Chain A, Glutathione S-Transferase A1-1 (E.C.2.5.1.18) 


: TPA: glutathione transferase A5 I 
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AK002480 
NP_666065.1 
















NM_010166 
NP 034296.1 








NM_008988 
NP 033014.1 
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AF294617 
AAG02 118.1 




F:(C-HI) 
-2.55, 
F:(C-D) 
-2.57 
















F:(C-HI) 
-2.57 








F:(C-HI) 
-2.6 




F:(C-HI) 
-2.61, 
F:(C-D) 
-2.33 




F:(C-HI) 
-2.63 




cystathionase isoform 1 ; cystathionine gamma-lyase; homoserine deaminase; homoserine dehydratase; cysteine 
desulfhydrase 




| Subclass: 


|Subclass: 


| Subclass: 


(Subclass: 


| Subclass: 


ISubclass: 


Eyes absent homolog ,• 




| Subclass: similar to punc 


| Subclass: putative neuronal cell adhesion molecule (punc) 


putative neuronal cell adhesion molecule (punc) 




cytochrome P450, subfamily IIB (phenobarbital-inducible), polypeptide 6 

J 




6-phosphofructo-2-kinase/fructose-2,6-biphosphatase 


; EYA1D | 


: Eyes absent homolog 2 I 


: EYA1A ! 


: eyes absent 1 isoform b; Eyes absent, Drosophila, homolog of, 1; Melnick-Fraser syndrome ] 


: eyes absent homolog 4 (Drosophila); j 


Eyes absent homolog 3 (EYA3) I 
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F:(C-D) 
-2.56 


■2.43, 


IR(C-HI) 




R(HI-D) 
-2.32 


F:(C-D) 
-5.64, 


-2.43, 


F:(C-HI) 




















| Subclass 


| Subclass 


| Subclass 


| Subclass 


| Subclass 






I cytochron 








and stero 


1 hydroxy-c 




| Subclass: 


| Subclass: 


Subclass: 


Subclass: 
Zfp93 in r 


Subclass: 


, Subclass: 


Subclass: 




: cytochrome P450 2C8 


: cytochrome P450 2C10 


: cytochrome P450 2C9 


: cytochrome P450 2C19 


: CytochromeP450 2C18(CYPIIC18)(P450-6B/29C) , 
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cn 
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Cl 


lelta-5-steroid dehydrogenase, 3 beta- and steroid delta-isomerase 1; Hydroxy-delta-5-steroid dehydrogenase, 3 beta-J 




: Zinc finger protein ZNF45 I 


: similar to Zinc finger protein 229 I 


: Hypothetical zinc finger-like protein ! 


zinc finger protein 93 homolog; zinc finger protein homologous to mouse Zfp93; zinc finger protein homologous to 
nouse; zinc finger protein 93 homolog (mouse) 


Zinc finger protein 226 ) 


similar to ZNF228 protein I 


zinc finger protein 228 I 
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aldehyde dehydrogenase 1 family 


aldehyde dehydrogenase 1 family, member B1 ; aldehyde dehydrogenase 5 | 


Aldehyde dehydrogenase X, mitochondrial precursor (ALDH class 2) j 


Aldehyde dehydrogenase 1A2 (Retinaldehyde-specific dehydrogenase type 2) (RALDH(II)) (RALDH-2) j 


Aldehyde dehydrogenase 1A1; aldehyde dehydrogenase 1, soluble; aldehyde dehydrogenase, liver cytosolic; ALDH 
cetaldehyde dehydrogenase 1 ; retinal dehydrogenase 1 


Subclass: Aldehyde dehydrogenase 1A3; aldehyde dehydrogenase 6 I 




Similar to RIKEN cDNA 0610025L15 gene product 




histidine ammonia-lyase; Histidine ammonia-lyase (histidase) 




putative N-acetyltransferase Camello 2 


; N-acetyltransferase 8; kidney- and liver-specific gene product; kidney- and liver-specific gene j 


: GLA | 


: kidney- and liver-specific gene product ] 


; hypothetical protein TSC501 | 


Subclass: 


Subclass: 


Subclass: 


Subclass: 
class 1; a 


| Alternate: 


[Alternate: 


| Alternate: 


| Alternate: 


F:(C-HI) 
-2.39, 
F:(C-D) 
-2.05 














F:(C-HI) 
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-2.48 




F:(C-HI) 
-2.39, 
F:(C-D) 
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hypothetical protein dJ726C3.2 




hypothetical protein dJ726C3.2 




ADP-ribosylation factor 


: ADP-ribosylation factor 4-like; ADP-ribosylation factor-like 6 [ 


: similar to ADP-ribosylation-like 4 . [ 


: ADP-ribosylation factor-like 7 j 


: ADP ribosylation factor-like protein j 




NADP-dependent malic enzyme 


Subclass: cytosolic malic enzyme 1; malic enzyme, cytoplasmic; malic enzyme 1, soluble; NADP-dependent malic enzyme; 
malate dehydrogenase; pyruvic-malic carboxylase 


Subclass: malic enzyme 3, NADP(+)-dependent, mitochondrial; malic enzyme, NADP+-dependent, mitochondrial; 
pyruvic-malic carboxylase; malate dehydrogenase; NADP-ME 


Subclass: malic enzyme 2, NAD(+)-dependent, mitochondrial; Malic enzyme, mitochondrial; malic enzyme 2, mitochondrial; 
pyruvic-malic carboxylase; malate dehydrogenase 


| Subclass: 


| Subclass: 


|Subclass: 


| Subclass: 
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NM_026104 
NP_080380.1 
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-2.17, 
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Subclass: Chain A, Glutathione Transferase A1-1 Complexed With An Ethacrynic Acid Glutathione Conjugate (Mutant R15k) 
Subclass: TPA: glutathione transferase A5 


Subclass: glutathione S-transferase A2; glutathione S-transferase 2; GST, class alpha, 2; liver GST2; glutathione 
S-alkyltransferase A2; glutathione S-aryltransferase A2; S-(hydroxyalkyl)glutathione lyase A2; glutathione S-aralkyltransferase 
A2; GST-gamma; HA subunit 2 


Subclass: 
A1;S-(hy 


| Subclass; 


I Subclass: 


Glutathione S-transferase 


I Subclass 


| Subclass 


Subclass 
Subclass 


pyruvate dehydrogenase kinase 




Subclass: proprotein convertase subtilisin/kexin type 1 preproprotein; prohormone convertase 3; prohormone convertase 1; 
neuroendocr ne convertase 1 ; proprotein convertase 1 


proprotein convertase 

Subclass: proprotein convertase subtilisin/kexin type 2; subtilisin-like prohormone convertases; prohormone convertase 2; 
neuroendocrine convertase 2; KEX2-like endoprotease 2; proprotein convertase PC5 


similar to RIKEN cDNA 1 700095F04 gene product 
Alternate: unnamed protein product 


: glutathione S-transferase A1 ; GST, class alpha, 1 ; glutathione S-alkyltransferase A1 ; glutathione S-aryltransferase 
droxyalkyl)glu athione lyase A1 ; glutathione S-aralkyltransferase A1 ; GST-epsi!on; glutathione S-transferase 2 


: glutathione S-transferase A3 | 


: Glutathione S-transferase A3-3 (GST class-alpha) | 


: pyruvate dehydrogenase kinase, isoenzyme 3 [ 


: pyruvate dehydrogenase kinase, isoenzyme 2 j 


: pyruvate dehydrogenase kinase, isoenzyme 4 
: pyruvate dehydrogenase kinase, isoenzyme 1 
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Subclass: Liver carboxylesterase precursor (Acyl coenzyme A:cholesterol acyltransferase) (ACAT) (Monocyte/macrophage 
serine esterase) (HMSE) (Serine esterase 1) (Brain carboxylesterase hBr1) 


: serine esterase N-terminal truncated (503 AA) | 


: brain carboxylesterase hBM j 


: Alternate: acyl coenzyme A:cholesterol acyltransferase j 




cholinephosphotransferase 1 




Golgi vesicular membrane trafficking protein p18; Bet1 (S. cerevisiae) homolog; Betlp homolog 




acetyl-CoA synthetase 


Subclass: acetyl-CoA synthetase isoform a; cytoplasmic acetyl-coenzyme A synthetase; acetate-CoA ligase; acyl-activating 
enzyme; acetate thiokinase; acetyl-CoA synthetase 


Subclass: acetyl-CoA synthetase isoform b; cytoplasmic acetyl-coenzyme A synthetase; acetate-CoA ligase; acyl-activating 
enzyme; acetate thiokinase; acetyl-CoA synthetase 




L-kynurenine/alpha-aminoadipate aminotransferase; kynurenine aminotransferase II 


Alternate: Similar to L-kynurenine/alpha-aminoadipate aminotransferase | 


|Subclass: 


|Subclass: 


Subclass: 












R(C-HI) 
-2.03, 
F:(C-D) 
-2.36 




F:(C-HI) 
-2.03, 
F:(C-D) 
-2.15 . 




F:(C-HI) 
-2.03, 
F:(C-D) 
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NP_059075. 
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F:(C-D)-2.8 
F:(C-HI)-2.6 








































F:(C-D)-3.1 








I Subclass: alanine-glyoxylate aminotransferase 2-like 1 j 


alanine-glyoxylate aminotransferase 




I Subclass: 


| Subclass: 


| Subclass: 


| Subclass: 


j Subclass: 


| Subclass: 


| Subclass: 


| Subclass: 


| Subclass: 


I Subclass: 


| Subclass: 


| Subclass: 


| Subclass: 


| Subclass: 


| Subclass: 


| Subclass: 


| Subclass: 


| Subclass: 


tubulin 




| Subclass: squalene epoxidase 


| Subclass: squalene monooxygenase 


: tubulin, epsilon 1; epsilon-tubulin | 


; tubulin epsilon | 


: tubulin, gamma 2 | 


: Tubulin gamma-1 chain (Gamma-1 tubulin) (Gamma-tubulin complex component 1) (GCP-1). | 


: tubulin gamma | 


: tubulin beta-5 | 


: Tubulin beta-4 chain (Tubulin beta-Ill). j 


: tubulin, beta, 2 j 


: Tubulin beta-1 chain | 


: tubulin beta j 


: tubulin alpha 6 j 


: tubulin, alpha 3; tubulin, alpha, brain-specific; hum-a-tub1; hum-a-tub2 [ 


: tubulin, alpha 2 isoform 2 . | 


: tubulin, alpha 2 isoform 1 \ 


: tubulin, alpha-like 2; tubulin, alpha 8 j 


: tubulin, alpha 1; testis-specific alpha tubulin; tubulin alpha-1 chain | 


: tubulin, alpha, ubiquitous | 


: alpha tubulin | 
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NP_038906. 
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9 




















F:(C-D)-2.6 
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Subclass: 
immunog 


Subclass 


Subclass 


Subclass 


Subclass: leukocyte immunoglobulin-like receptor, subfamily A (with TM domain), member 2; leukocyte immunoglobulin-like 
receptor 7 


Subclass 
receptor ( 


jSubclass 


Subclass 


jSubclass 


Receptor 




| Subclass 


(Subclass 


Subclass 


Subclass 


Subclass 


Subclass 


Subclass 


Subclass 


calcium binding protein 


: leukocyte i 
lobulin-like i 


: leukocyte i 


: leukocyte : 


: leukocyte i 


: leukocyte 
3 


: leukocyte 


: leucocyte 


; leucocyte 


: calcium binding protein 5 j 


: calcium binding protein 4 I 


: calcium binding protein 1 isoform 2; calbrain; calcium binding protein 5; calcium binding protein 1 ; caibrain j 


: calcium binding protein 1 isoform 1; calbrain; calcium binding protein 5; calcium binding protein 1 ; calbrain j 


: calcium binding protein 1 j 


: calcium binding protein 2 isoform 2; CaBP2 | 


; calcium binding protein 2 isoform 1 ; CaBP2 | 


: calcium binding protein 2 | 


immunoglobuli 
receptor 2 


Immunoglobuli 


immunoglobuli 


immunoglobuli 


immunoglobulin-like receptor, subfamily A (with TM domain), member 1; leukocyte immunoglobulin-like 


immunoglobulin-like receptor, subfamily A \ 


immunoglobulin-like receptor-3; LIR-3 j 


immunoglobulin-like receptor | 


n-like receptor, subfamily B (with.TM and ITIM domains), member 2; leukocyte 


in-like receptor, subfamily B (with TM and ITIM domains), member 5 j 


in-like receptor, subfamily B . j 


in-like receptor, subfamily A, member 5; immunoglobulin-like transcript 10.. j 
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Subclass: leukocyte immunoglobulin-like receptor, subfamily B (with TM and ITIM domains), member 6; immunoglobulin-like 
transcript 8 


Subclass: leukocyte immunoglobulin-like receptor, subfamily B (with TM and ITIM domains), member 1 ; leukocyte 
immunoglobulin-like receptor 1; CD85 antigen 


leukocyte immunoglobulin-like receptor 1 
leukocyte immunoglobulin-like receptor-2 


leucocyte immunoglobulin-like receptor-4; LIR-4 ' j 


immunoglobulin-like transcript 5 protein, ILT5 
immunoglobulin-like transcript 6 


immunoglobulin-like transcript 7; ILT7 | 


killer cell inhibitory receptor p91 precursor \ 


monocyte inhibitory receptor precursor ) 




v-myc myelocytomatosis viral oncogene homolog; Avian myelocytomatosis viral (v-myc) oncogene homolog; C-MYC; v-myc 
avian myelocytomatosis viral oncogene homolog 




hydroxylase 


: tryptophan hydroxylase \ 


: tryptophan hydroxylase 1 (tryptophan 5-monooxygenase); tryptophan hydroxylase (tryptophan 5-monooxygenase) | 


: neuronal tryptophan hydroxylase | 


: phenylalanine hydroxylase j 


: tyrosine hydroxylase I 


: hydroxylase 2,Tyr 

: hydroxylase 3,Tyr j 




[Subclass: 
Subclass: 


| Subclass: 


Subclass: 
Subclass: 


I Subclass: 


ISubclass: 


I Subclass: 


JSubclass; 


| Subclass: 


|Subclass; 


| Subclass 


Subclass 
Subclass: 
Subclass 
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Subclass: fibroblast growth factor receptor 2 isoform 3 precursor; keratinocyte growth factor receptor; K-sam protein; protein 
tyrosine kinase, receptor like 14; FGF receptor; bacteria-expressed kinase; fibroblast growth factor receptor BEK; tyrosylproteir 
kinase; hydroxyaryl-protein kinase 


Subclass: fibroblast growth factor receptor 2 isoform 2 precursor; keratinocyte growth factor receptor; K-sam protein; protein 
tyrosine kinase, receptor like 14; FGF receptor; bacteria-expressed kinase; fibroblast growth factor receptor BEK; tyrosylproteir 
kinase; hydroxyaryl-protein kinase 


Subclass: fibroblast growth factor receptor 2 isoform 1 precursor; keratinocyte growth factor receptor; K-sam protein; protein 
tyrosine kinase, receptor like 14; FGF receptor; bacteria-expressed kinase; fibroblast growth factor receptor BEK; tyrosylproteir 
kinase; hydroxyaryl-protein kinase 


Subclass: fibroblast growth factor receptor 1 isoform 9 precursor; fms-related tyrosine kinase-2; heparin-binding growth factor 
receptor; FMS-like tyrosine kinase 2; basic fibroblast growth factor receptor 1; N-sam tyrosine kinase; FLG protein; 
protein-tyrosine kinase; tyrosylprotein kinase; hydroxyaryl-protein kinase 
Subclass: fibroblast growth factor receptor 2 


Subclass: fibroblast growth factor receptor 1 isoform 7 precursor; fms-related tyrosine kinase-2; heparin-binding growth factor 
receptor; FMS-like tyrosine kinase 2; basic fibroblast growth factor receptor 1; N-sam tyrosine kinase; FLG protein; 
protein-tyrosine kinase; tyrosylprotein kinase; hydroxyaryl-protein kinase 


Subclass: fibroblast growth factor receptor 1 isoform 5 precursor; fms-related tyrosine kinase-2; heparin-binding growth factor 
receptor; FMS-like tyrosine kinase 2; basic fibroblast growth factor receptor 1 ; N-sam tyrosine kinase; FLG protein; 
protein-tyrosine kinase; tyrosylprotein kinase; hydroxyaryl-protein kinase 

Subclass: fibroblast growth factor receptor 1 isoform 6 precursor;fms-related tyrosine kinase-2; heparin-binding growth factor 
receptor; FMS-like tyrosine kinase 2; basic fibroblast growth factor receptor 1 ; N-sam tyrosine kinase; FLG protein; 
protein-tyrosine kinase; tyrosylprotein kinase; hydroxyaryl-protein kinase 


Subclass: fibroblast growth factor receptor 1 isoform 4 precursor; fms-related tyrosine kinase-2; heparin-binding growth factor 
receptor; FMS-like tyrosine kinase 2; basic fibroblast growth factor receptor 1 ; N-sam tyrosine kinase; FLG protein; 
protein-tyrosine kinase; tyrosylprotein kinase; hydroxyaryl-protein kinase 


Subclass: fibroblast growth factor receptor 1 isoform 3 precursor; fms-related tyrosine kinase-2; heparin-binding growth factor 
receptor; FMS-like tyrosine kinase 2; basic fibroblast growth factor receptor 1 ; N-sam tyrosine kinase; FLG protein; 
protein-tyrosine kinase; tyrosylprotein kinase; hydroxyaryl-protein kinase 
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Subclass: fibroblast growth factor receptor 2 isoform 4 precursor; keratinocyte growth factor receptor; K-sam protein; protein 
tyrosine kinase, receptor like 14; FGF receptor; bacteria-expressed kinase; fibroblast growth factor receptor BEK; tyrosylproteir 
kinase; hydroxyaryl-protein kinase 


Subclass: ibrobiast growth factor receptor 2 isoform 5 precursor; keratinocyte growth factor receptor; K-sam protein; protein 
tyrosine kinase, receptor like 14; FGF receptor; bacteria-expressed kinase; fibroblast growth factor receptor BEK; tyrosylproteir 
kinase; hydroxyaryl-protein kinase 


Subclass: fibroblast growth factor receptor 2 isoform 6 precursor; keratinocyte growth factor receptor; K-sam protein; protein 
tyrosine kinase, receptor like 14; FGF receptor; bacteria-expressed kinase; fibroblast growth factor receptor BEK; tyrosylproteir 
kinase; hydroxyaryl-protein kinase 


Subclass: fibroblast growth factor receptor 2 isoform 7 precursor; keratinocyte growth factor receptor; K-sam protein; protein 
tyrosine kinase, receptor like 14; FGF receptor; bacteria-expressed kinase; fibroblast growth factor receptor BEK; tyrosylproteir 
kinase; hydroxyaryl-protein kinase 


Subclass: fibroblast growth factor receptor 2 isoform 8 precursor; keratinocyte growth factor receptor; K-sam protein; protein 
tyrosine kinase, receptor like 14; FGF receptor; bacteria-expressed kinase; fibroblast growth factor receptor BEK; tyrosylproteir 
kinase; hydroxyaryl-protein kinase 


Subclass: fibroblast growth factor receptor 2 isoform 9 precursor; keratinocyte growth factor receptor; K-sam protein; protein 
tyrosine kinase, receptor like 14; FGF receptor; bacteria-expressed kinase; fibroblast growth factor receptor BEK; tyrosylproteir 
kinase; hydroxyaryl-protein kinase 


Subclass: fibroblast growth factor receptor 2 isoform 10 precursor; keratinocyte growth factor receptor; K-sam protein; protein 
tyrosine kinase, receptor like 14; FGF receptor; bacteria-expressed kinase; fibroblast growth factor receptor BEK; tyrosylproteir 
kinase; hydroxyaryl-protein kinase 


Subclass: fibroblast growth factor receptor 2 isoform 1 1 precursor; keratinocyte growth factor receptor; K-sam protein; protein 
tyrosine kinase, receptor like 14; FGF receptor; bacteria-expressed kinase; fibroblast growth factor receptor BEK; tyrosylproteir 
kinase; hydroxyaryl-protein kinase 


Subclass: fibroblast growth factor receptor 2 isoform 12 precursor; keratinocyte growth factor receptor; K-sam protein; protein 
tyrosine kinase, receptor like 14; FGF receptor; bacteria-expressed kinase; fibroblast growth factor receptor BEK; tyrosylproteir 
kinase; hydroxyaryl-protein kinase 
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| Subclass: vascular endothelial growth factor receptor 2 j 


Subclass: vascular endothelial growth factor receptor j 


Subclass: ret proto-oncogene isoform a; RET transforming sequence; hydroxyaryl-protein kinase: cadherin family member 12; 
oncogene RET 


Subclass: ret proto-oncogene isoform b; RET transforming sequence; hydroxyaryl-protein kinase; cadherin family member 12; 
oncogene RET 


Subclass: ret proto-oncogene (multiple endocrine neoplasia MEN2A, MEN2B and medullary thyroid carcinoma 1, Hirschsprung 
disease) 


Subclass: 
oncogens 


j Subclass: 


[Subclass: 


j Subclass: 


| Subclass 


| Subclass 


| Subclass 


| Subclass 


| Subclass 


| Subclass 


Subclass: fibroblast growth factor receptor 4 isoform 2 precursor; tyrosine kinase related to fibroblast growth factor receptor; 
tyrosylprotein kinase; protein-tyrosine kinase; hydroxyaryl-protein kinase 


Subclass 
tyrosylpro 


| Subclass 


| Subclass 


jSubclass 


jSubclass: 


Subclass: fibroblast growth factor receptor 2 isoform 1 3 precursor; keratinocyte growth factor receptor; K-sam protein; protein 
tyrosine kinase, receptor like 14; FGF receptor; bacteria-expressed kinase; fibroblast growth factor receptor BEK; tyrosylproteir 
kinase; hydroxyaryl-protein kinase 


: ret proto-oncogene isoform c; RET transforming sequence; hydroxyaryl-protein kinase; cadherin family member 12; 
i RET 


: RET tyrosine kinase receptor | 


: heparin-binding growth factor receptor K-sam precursor | 


: heparin-binding growth factor receptor variant alpha-a2 • | 


: heparin-binding growth factor receptor j 


: keratinocyte growth factor receptor 2 isoform BEK j 


: keratinocyte growth factor receptor 2 isoform KGFR j 


: keratinocyte growth factor receptor 2 isoform K-sam-IIC3 J 


: keratinocyte growth factor receptor | 


: fibroblast growth factor receptor 4, soluble-form splice variant | 


: fibroblast growth factor receptor 4 isoform 1 precursor; tyrosine kinase related to fibroblast growth factor receptor; 
tein kinase; protein-tyrosine kinase; hydroxyaryl-protein kinase 


: fibroblast growth factor 4 j 


: fibroblast growth factor receptor 3 isoform 2 precursor; hydroxyaryl-protein kinase; tyrosine kinase JTK4 j 


: fibroblast growth factor receptor 3 isoform 1 precursor; hydroxyaryl-protein kinase; tyrosine kinase JTK4 j 


: fibroblast growth factor receptor 3 | 



ir.HUo/roozsn/ADd 



9I1-Z60/1-00Z O/W 



WO 2004/092416 



PCT/US2004/010I9I 



.E '5 
S o 



II 
S g 



o 

CM 



M CO W CO 

C d C d 

cr a c ct 

o o o o 



a a a L 

(Q (Q (Q (D 

1' §' §■ ~° 

til Q) Q) 2 

3 a a 2 

a. a. a. ^ 



o o o o. 

H T3 U 5j' 

o o o 5" 



a. Ci. Q- 3 

O O O Q- 

3 3 3 3 

t» CM Ol 3 



CD CO CO o 

o o o 2- 
o o o <L 



•o tj -a q. 

5 CD CD O 

■a T3 "o 3 

o 3 3 2>. 

r> -a -o 3 

3 3 3 ro 

CD CD CD ~S 

3' 3' z>" 



3 ZJ Z) 
CD CD CD 
CO CQ CO 



=3 3 3 



o o o 

"3 T3 "O 
3 3 3 



3 □ D 

ro ro 
oo oo oo 



o o o 



"a -a -o 



TD T3 T3 



3 3 3 



ir>ioio/t-(Mi<:sn/x3d 



!>It-Z60/f00Z OAV 



WO 2004/092416 



PCT/US2004/0I019I 



5 5 t= -g 



5 ^ Q. ro 
o o o o 



CD 0) 

.9 cc 



^ n n £i 

3 3 Z> 3 

CO CO CO CO 



CD 


CO 


lata: 


lata 


Q. 


CL 
CO 


O 


o 
.c 




Q. 

CO 


i5 


lo 


(b 




ctos 


CtO! 


S 


E 


CO 


o" 


0) 


CD 


ro 


CO 


o 


o 


< 


< 

b" 


.2 
ro 


ro 


.c 

Q_ 


_c 

CL 
CO 


CO 

o 


o 

-SZ 


x: 

Q. 


Q- 




15 


tost 


:t0Sl 


o 


fruc 


cri 


o 
a> 


asi 


ro 


o 

T3 


"6 


ro 


33 


;lass: 


:lass: 


-Q 


-Q 


CO 


=3 

CO 



o "c 



_ro 

£ -2 
co 42 



to 

CXI 



63 ^1 



03 W 

> > 



O. O ~0 O. O. 



a. ^ 

i E 



S» CD 

o o 

5 §■ 



2. CD 
T3_ 
o" 

3 i 



IrtlOIO/tMMIZSflALDd 



yircr.o/t-ooz om. 



WO 2004/0!>24ir> 



PCT/US2004/010191 



£ o & 



.g » j= 



'5 O S 

O Q. O g> Q. 



o o ro 



^ ^ ^ 5 6 

X m " n 



1 £ ~ * 8 E 



S to H 
I 8 I ? 



0.0.0.0.0 



W CO 00 



J5J5.S9J9J5J2 x J5 x _ro _ro 
ooooioo gogoo 

OT W CO (O CO (fl O CO u CO CO 



69 



Cfl ui cn 3 O) 5' 

B- g- 8- -$ |f 



zj' =>' a' n n <D 



2 2 2 3 



CD S- 

3 *3 



™ -a 

CD O 
O -i 



3 3 
o o - 
> - 



CD CD 
CO CO 

c c 


CD 
CQ 

c 


fact< 
regi 


CD 03 


CD 


CO -> 


o o 
^ «5 


O 
>2 


o 
•3 


acto 
acto 


acto 


acto 


cn cn 


cn 


*» 


o 9, 




55 

TI 


LUJC 






cr CD 




i 






X5 

=r 






o 






CD 






T3 
CD 






O. 

o" 






5" 






CD 

3, 






3 






CD 

CQ 



1 1 



° I 

3 i 

05 - • 

° I 

CD W 



ir»ioio/t-(xizsri/x3d 



WO 2004/092416 PCT/US2004/010191 



stero!-C4-methy! oxidase-like; C-4 methyl sterol 
condensing enzyme 


Subclass: acyl-malonyl condensing enzyme J 


Subclass: hypothetical protein FLJ40 154 


B-cell translocation gene 3; abundant in neuroepithelium area 


cytochrome P450 


Subclass: cytochrome P450, subfamily I (aromatic compound-inducible), polypeptide 2; dioxin-inducible P3-450; P450 form 4; 
microsomal monooxygenase; xenobiotic monooxygenase; aryl hydrocarbon hydroxylase; flavoprotein-linked monooxvaenase 


Subclass: cytochrome P450, subfamily I (aromatic compound-inducible), polypeptide 1; flavoprotein-linked monooxygenase; 
cytochrome P1-450, dioxin-inducible; aryl hydrocarbon hydroxylase; P450 form 6; xenobiotic monooxygenase; microsomal 
monooxygenase " 


Subclass: cytochrome P450 CYP1B1 




suppressor of cytokine signalmg-2; STAT induced STAT inhibitor-2; cytokine-inducible SH2 protein 2 




tumor necrosis factor, alpha-induced protein 2 
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NM.025436 

NP_079712. 
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NM_008509 
NP_032535.1 






AK004138 
BAB23 187.1 




NM_011802 
NP 035932.1 




NM_008322 
NP_032348.1 




AK011896 
BAB27902.1 








NM_011300 
NP_035430.1 


NM_008303 
NP_032329.1 








NM_025754 
NP_080030.1 
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lipoprotein lipase 


I Alternative: putative p1 50 j 


| Alternative: hypothetical protein FLJ20048 j 


ORFII 




CIpX caseinolytic protease X homolog; energy-dependent regulator of proteolysis; CIpX (caseinolytic protease X, E. coli) 




isocitrate dehydrogenase 2 (NADP+), mitochondrial; Isocitrate dehydrogenase, mitochondrial 




unnamed protein product 




| Subclass: similar to ribosomal protein S7 | 


aldolase 

Subclass: Aldolase A; fructose-bisphosphate aldolase; Aldolase A, fructose-bisphosphatase 
Subclass: aldolase C, fructose-bisphosphate; Aldolase C, fructose-bisphosphatase 

heat shock 10kDa protein 1 (chaperonin 10); heat shock 10kD protein 1 (chaperonin 10) 

ribosomal protein 

Subclass: ribosomal protein S7; 40S ribosomal protein S7 
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type I hair keratin 3A; Ha-3I; hard keratin, type 1,31; keratin, hair, acidic,3A | 


•I 
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1 
1 
'S 

m 
g 
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.a 
Jq 

i 


: Keratin, type I cuticular HA4 (Hair keratin, type I HA4). j 


: type I hair keratin 5; Ha-5; hard keratin, type I, 5 j 


: type I hair keratin 6 j 


: type I hair keratin 7 i 


: Keratin, type I cuticular HAS (Hair keratin, type I HAS). j 


: type I intermediate filament cytokeratin ] 




cell death activator CIDE-A 


| Alternate: Similar to cell death- inducing DFFA-like effector a ) 


BRCAl associated protein 


|Subclass: BRCAl-associated protein 2 ] 


(Subclass: putative DDB pi 27-associated protein j 




presynaptic cytomatrix protein 


| Subclass: Piccolo protein (Aczonin). ] 


| Alternate: Zinc finger protein | 




Glycoprotein 


| Subclass: Orosomucoid-l (alpha-l-acid glycoprotein-1); alpha-l-acid glycoprotein 1 | 


| Subclass: orosomucoid 2; alpha-l-acid glycoprotein, type 2 | 




T.aminin 


Subclass: Laminin beta chain j 


Subclass: 


Subclass: 


Subclass: 


Subclass: 


Subclass; 


Subclass: 


| Subclass: 


| Subclass: 
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NMJ07702 
NP 031728.1 




AK013885 
NP 082503.1 








NM_011995 
NP 036125.2 








NM_013623 
NP 038651.1 








NM_008484 
NP 032510.1 
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: Similar to carboxypeptidase Z | 


: similar to Potential carboxypeptidase-like protein X2 precursor 


: Potential carboxypeptidase-like protein X2 precursor f 




inositol monophosphatase 


: inositol(myo)-l(or4)-inonophosphatase 1 


: inositol(myo)-l(or 4)-mouophosphatase 2 


: brain myo-inositol monophosphatase A2b; EVIPase A2b | 


: Chain A, Human Inositol Monophosphatase (E.C.3. 1 .3.25) Duner Complex With Gadolinium And Sulfate 1 


Hemoglobin 


ISubclass: hemoglobin alpha- 1 globin chain 


Subclass: hemoglobin alpha-2 j 




Cyytochrome P450 


Subclass: cytochrome P450, family 39, subfamily A, polypeptide 1; oxysterol 7alpha-hydroxylase; cytochrome P450, subfamily XXXIX 
(oxysterol 7 alpha-hydroxylase), polypeptide 1 




Exocyst complex component 


Subclass: Exocyst complex component Sec8 




Multidrug resistance-associated protein (Multi-specific organic anion tranporter; ATP-binding cassette 


Multidrug resistance-associated protein 5 (Multi-specific organic anion tranporter-C) (MOAT-O (dABCI 1 ) CSMP P^ 1 


ATP-binding cassette, sub-family C, member 5; canalicular multispecific organic anion transporter C I 


ATP-binding cassette protein CI 1 " T * ""j 


1 Subclass 


I Subclass 


1 Subclass 


Subclass 


Subclass 


| Subclass 


Subclass 


Subclass: 


Subclass: 


Subclass: 
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NP 444491.1 
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NP_032244.1 








NM_018887 
NP 061375.1 






NM_009148 
NP 033174.1 






NM_013790 
NP 038818.1 
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Subclass: ATP-binding cassette, subfamily B, member 4 isoform C; P glycoprotein 3/mulriple drug resistance 3; P-glycoprotein-3/multiple 
drug resistance-3; multiple drug resistance 3 


Subclass: ATP-binding cassette, sub-family C, member 6; anthracycline resistance-associated [ 


Subclass: cystic fibrosis transmembrane conductance regulator, ATP-binding cassette (sub-family C, member 7); cystic fibrosis transmembrane 
conductance regulator; ATP-binding cassette, sub-family C member 7; CFTR/MRP 


ATP-binding cassette, sub-family C, member 8; Sulfonylurea receptor; sulfonylurea receptor (hj^erinsulinemia) j 


ATP-binding cassette, sub-family C, member 9 isoform SUR2A-delta-14; sulfonylurea receptor 2A | 


ATP-binding cassette, sub-family C, member 10; multidrug resistance-associated protein 7 1 


ATP-binding cassette, sub-family B (MDR/TAP), member 1; P glycoprotein 1/multiple drug resistance 1; P-glycoprotein-l/multiple 
stance- 1; multidrug resistance 1 


| Subclass: Cystic fibrosis transmembrane conductance regulator (CFTR) (cAMP-dependent chloride channel). | 




Membrane glycoprotein 


| Subclass: Cell-surface receptor | 


Subclass: Tumor-associated calcium signal transducer 1 precursor (Major gastrointestinal tumor-associated protein GA733-2) (Epithelial cell 
surface antigen) (Epithelial glycoprotein) (EGP) (Adenocarcinoma-associated antigen) (KSA) (KS 1/4 antigen) (Cell surface glycoprotein 
Trop-1). 


Subclass: tumor-associated calcium signal transducer 1 precursor; membrane component, cliromosome 4, surface marker (35kD glycoprotein); 
MK-1 antigen; antigen identified by monoclonal antibody AUAl 


Subclass: tumor-associated calcium signal transducer 2 precursor; membrane component, chromosome 1, surface marker 1 (40kD glycoprotein 
identified by monoclonal antibody GA733); epithelial glycoprotein- 1 


Subclass: Tumor-associated calcium signal transducer 2 precursor (Pancreatic carcinoma marker protein GA733-1) (Cell surface glycoprotein 
Trop-2). 




Regulator of neurotransmitter release 


: Synapsins ] 


: Synapsin III | 


: synapsin III isoform IUa | 


Subclass: 


Subclass: 


| Subclass: 


Subclass: 
drug resi: 


| Subclass 


| Subclass 


| Subclass 
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Keratin 


cytokeratin 9 

Keratin, type I cytoskeletal 10 (Cytokeratin 10) (KIO) (CK 10). 
Keratin- 12 

kerarin 13, type I, cytoskeletal 
keratin 14, type I, cytoskeletal 

kerarin 15; keratin- 15, basic; keratin- 15, beta; type I cytoskeletal 15; cytokeratin 15 
keratin 1 6 , type I, cytoskeletal 
keratin 17 
keratin 18 

keratin 19; keratin, type I cytoskeletal 19; keratin, type 1, 40-kd; cytokeratin 19; 40-kDa keratin intermediate filament precursor gen< 
. keratin 20; cytokeratin 20; keratin, type I cytoskeletal 20 
; Keratin, type I cytoskeletal 23 (Cytokeratin 23) (K23) (CK 23). 
: keratin 24 

: type I hair keratin 1; Ha-1; hard keratin, type 1, 1; keratin, hair, acidic, 1 
: type I hair keratin 2; Ha-2; hard keratin, type I, 2; keratin, hair, acidic,2 
: Keratin, type I cuticular HA3-I (Hair keratin, type I HA3-I). 
: type I hair keratin 3A; Ha-3I; hard keratin, type 1,31; keratin, hair, acidic,3A 


: type I hair keratin 3B; Ha-3I1; hard keratin, type 1, 311; keratin, hair, acidic,3B 

• hair ketarin acidic 3-11 _____ 


: Keratin, type I cuticular HA4 (Hair keratin, type I HA4). 


: type I hair keratin 6 I 




1 
.b 

1 




■B 


1 Subclass: galectiu 4; lectin galactoside-binding soluble 4 1 


Subclass: 
Subclass: 
Subclass: 
Subclass: 
Subclass: 
Subclass: 
Subclass: 
Subclass: 
Subclass: 
Subclass: 
Subclass; 
Subclass: 
Subclass: 
Subclass 
Subclass 
Subclass 
Subclass 


Subclass 
Subclass 


Subclass 
Subclass 


1 Subclass 


1 Subclass 


I Subclass 
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cytochrome P450 


: cytochrome P450, family 2 j 


: cytochrome P450, family 2, subfamily C | 


: cytochrome P450, family 2, subfamily C, polypeptide 18; cytochrome P450, subfamily IIC (mephenytoin 4-bydroxylase), 
ide 17; cytochrome P450, subfamily IIC (mephenytoin 4-hydroxylase), polypeptide 18; microsomal monooxygenase; 
ein-linked monooxygenase 


Subclass: cytochrome P450, family 2, subfamily C, polypeptide 19; cytochrome P450, subfamily IIC (mephenytoin 4-hydroxylase), 
polypeptide 19; mephenytoin 4-hydroxylase; microsomal monooxygenase; xenobiotic monooxygenase; flavoprotein-linked monooxygenase 


| Subclass: cytochrome P450 arachidonic acid epoxygenase isoform, Cyp 2C8 [human, kidney, Peptide Partial, 485 aa]. j 


Subclass: cytochrome P450, family 2, subfamily C, polypeptide 8 isoform 1; cytochrome P450, subfamily IIC (mephenytoin 4-hydroxylase), 
polypeptide 8; mephenytoin 4-hydroxylase; microsomal monooxygenase; xenobiotic monooxygenase; flavoprotem-linked monooxygenase; 
P450 form 1 


Subclass: cytochrome P450, family 2, subfamily C, polypeptide 8 isoform 2; cytochrome P450, subfamily IIC (mephenytoin 4-hydroxylase), 
polypeptide 8; mephenytoin 4-hydroxylase; microsomal monooxygenase; xenobiotic monooxygenase; flavoprotein-linked monooxygenase; 
P450 form 1 


cytochrome P450, family 2, subfamily C, polj'peptide 9; cytochrome P450, subfamily IIC (mephenytoin 4-hydroxylase), polypeptide 
tenytoin 4-hydroxylase; microsomal monooxygenase; xenobiotic monooxygenase; flavoprotein-linked monooxygenase; cytochrome 
Dfamily IIC (mephenytoin 4-hydroxylase), polypeptide 9 

Cytochrome P450 2C10 (CYPIICIO) (P450 MP-8) (S-mephenytom 4-hydroxylase) (P-450MP). 


cytochrome P450, family 2, subfamily A - j 


cytochrome P450, family 2, subfamily A, polypeptide 6; coumarin 7-hydroxylase; cytochrome P450, subfamily IIA 
rbital-inducible), polypeptide 3; cytochrome P450, subfamily IIA (phenobarbital-inducible), polypeptide 6; xenobiotic 
genase; flavoprotein-linked monooxygenase 

cytochrome P450, family 2, subfamily A, polypeptide 7 isoform 1 ; cytochrome P450, subfamily IIA (phenobarbital-inducible), 
de7 


Subclass: cytochrome P450, family 2, subfamily A, polypeptide 13; cytochrome P450, subfamily DA (phenobarbital-inducible), polypeptide 1] 


Subclass: cytochrome P450, family 2, subfamily B j 


Subclass: ytochrome P450, family 2, subfamily B, polypeptide 6; cytochrome P450, subfamily IB (phenobarbital-inducible) polypeptide 6 [ 


(Subclass: 


| Subclass: 


Subclass 
polypept 
flavoprol 


Subclass: 
10; mepb 
P450, sul 
Subclass: 


j Subclass; 


Subclass: 
(phenoba 
monooxy 
Subclass: 
polypepti 
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Subclass: cytochrome P450, subfamily IID j 


Subclass: cytochrome P450, subfamily IID, polypeptide 6; debrisoquine 4-hydroxylase; microsomal monooxygenase; xenobiotic 
monooxygenase; 

flavoprotem-linked monooxygenase; cytochrome P450, subfamily IID (debrisoquine, sparteine, etc., -metabolizing)-like 1 


| Subclass: cytochrome P450, family 2, subfamily E | 


Subclass: cytochrome P450, family 2, subfamily E, polypeptide 1; cytochrome P450, subfamily HE (ethanol-inducible), polypeptide 1; 
microsomal monooxygenase; xenobiotic monooxygenase; flavoprotein-linked monooxygenase; cytochrome P450, subfamily HE 
(ethanol-inducible) 

Subclass: cytochrome P450, family 2, subfamily F 

Subclass: cytochrome P450, family 2, subfamily F, polypeptide 1; cytochrome P450, subfamily HP, polypeptide 1; microsomal 
monooxygenase; xenobiotic monooxygenase; flavoprotein-linked monooxygenase 
Subclass: cytochrome P450, family 2, subfamily J 


Subclass: cytochrome P450, family 2, subfamily J, polypeptide 2; cytochrome P450, subfamily IIJ (arachidonic acid epoxygenase) polypeptide 
2; microsomal monooxygenase; flavoprotein-linked monooxygenase 


| Subclass: cytochrome P450, family 2, subfamily S ] 


cytochrome P450, family 2, subfamily S, polypeptide 1; cytochrome P450 family member predicted from ESTs; cytochrome P540, 
f US, polypeptide 1; cytochrome P450, subfamily IIS, polypeptide 1 
cytochrome P450, family 1 


cytochrome P450, family 1, subfamily A j 


cytochrome P450, family 1, subfamily A, polypeptide 1; aryl hydrocarbon hydroxylase; cytochrome P450, subfamily I (aromatic 
d-inducible), polypeptide 1; flavoprotein-linked monooxygenase; cytochrome PI -450, dioxin-inducible; P450 form 6; xenobiotic 
genase; microsomal monooxygenase - 


| Subclass: cytochrome P450, family 1, subfamily B j 


Subclass: cytochrome P450, family 1, subfamily B, polypeptide I; aryl hydrocarbon hydroxylase; cytochrome P450, subfamily I 
(dioxin-inducible), polypeptide 1 (glaucoma 3, primary infantile); microsomal monooxygenase; xenobiotic monooxygenase; flavoprotein-Iinkec 
monooxygenase 




ABC-transporter; peptide transporter; ATP -binding cassette 


| Subclass: ATP-bindiiig cassette, sub-family B, member 9 isoform 1 | 


Subclass: 
subfamil; 
Subclass: 


| Subclass: 


Subclass: 
compoun 
monooxy 
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protein up-regulated by BCG-CWS 


unnamed protein product 
KIAA0062 


similar to KIAA0062 | 


Unknown (protein for MGC:23235) | 


similar to PP3898 


XAB2 | 


KIAA1 177 protein | 


HCNP protein; XPA-binding protein 2 I 
Similar to HCNP protein; XPA-binding protein 2 | 


FLJ00081 protein | 


nocturnin 


|Alternate: CCR4 carbon catabolite repression 4-like (S. cerevisiae); CCR4-like (carbon catabolite repression 4, S. cerevisiae) | 


guanine nucleotide binding protein (G protein) 


: guanine nucleotide binding protein (G protein), alpha 14; guanine nucleotide-binding protein 14 J 


: GTP-binding protein alpha q | 


: Guanine nucleotide-binding protein G(Y), alpha subunit (Alpha-1 1 ) | 


: guanine nucleotide binding protein (G protein), alpha 15 (Gq class) | 


similar to GNA1 5; ALPHA-1 6 


similar to Guanine nucleotide-binding protein, alpha-1 1 subunit (GL2) I 


Guanine nucleotide-binding protein G(O), alpha subunit 2 j 


Alternate: 
Alternate: 


| Alternate: 


| Alternate: 


| Alternate: 


| Alternate: 


Alternate: 
Alternate: 


(Alternate: 


|Subclass: 


(Subclass: 


(Subclass; 


(Subclass: 


Subclass: 


iSubclass: 


Subclass: 
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4.88 
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3.75 
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3.08, 
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NP 080504.1 
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NP_032163.1 
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^as-related protein Rab-30 


protein 


Arylsulfatase A; Chain: P; Synonym: Cerebroside-3-Sulfate-Sultatase; tc. 6. i .o.o . 

Similar to arylsulfatase A . ... 

M ^n^-tn^,,.!^ precursor. Galactosamine (N-acetyl)-6-sulfate sulfatase; chondro.ttnase 


Unknown (protein for MGC:24090) 
arylsulfatase F 
similar to arylsulfatase F 
: Unknown (protein for MGC:31 932) 

: arylsulfatase D precursor, isoform a — _ _ 


cytochrome P450, subfamily IID, polypeptide 6; debrisoquine 4-hydroxylase; microsomal monooxygenase, xenob.otic 
monooxygenase; flavoprotein-linked monooxygenase _ 


general transcription factor IIF, polypeptide 1 (74kD subunit) 


Alternate: Transoripticn Initiation Factor lif, Subunit; Chain: A, C, E, G; Fragment: Residues 2-119; Synonym: Transection 
Initiation FactorRap30 , — — 1 


KIAA1879 protein 


growth arrest-specific 6; AXL stimulatory factor 

Alternate: protein S (alpha); Protein S, alpha . ■ 1 


KIAA1001 
Alternate - 


Alternate: 
Alternate: 


Alternate: 
Alternate 
Alternate 
Alternate 
Alternate 


S 

& a 


J:(C-D) ! 
2.44 












U:(C-D) 
2.38 


U:(C-D) 
2.21 




O O) I (c 
±> cn ±> 2 


U:(C-D) 
2.14 


AK017185 I 


AK018132 
BAB31086.1 












AK004984 
BAB23719.1 


AKO 13002 
BAB28588.1 




AK007293 
BAB24937.1 


NM_0 19521 
NP 062394.1 
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dJ1024N4.1 (novel Sodium:solute symporter family member similar to SLC5A1 (SGLT1)) 

sodium/myo-inositol cotransporter 2; putative sodium-coupled cotransporter RKST1 ; homolog of rabbit KST1 

putative sodium-coupled cotransporter RKST1 

simiiarilar to 597 aa protein related to Na/glucose cotransporters 

Sodium/myo-inositol cotransporter (Na(+)/myo-inositol cotransporter) 


hypothetical protein FLJ25217 ! 




transmembrane 9 superfamily 


Subclass: transmembrane 9 superfamily member 1; multispanning membrane protein (70kD); transmembrane protein 9 
superfamily member 1 


| Subclass: transmembrane 9 superfamily member 2; 76 kDa membrane protein; transmembrane protein 9 superfamily member 2 | 


| Subclass: similar to Transmembrane 9 superfamily protein member 3 precursor (SM-1 1044 binding protein) (EP70-P-iso) j 


| Subclass: transmembrane protein TM9SF3 \ 


j Alternate: SM-1 1044 binding protein j 


jAlternate: KIAA0255 gene product | 


|Alternate: endomembrane protein emp70 precursor isolog j 


|Alternate: unnamed protein product | 


jAlternate: unnamed protein product | 




proteinase inhibitor 


: alphal -antichymotrypsin .... j 


: similar to Alpha-1 -antichymotrypsin precursor (ACT) 

: serine (or cysteine) proteinase inhibitor, clade A (alpha-1 antiproteinase, antitrypsin), member 3 

: chymotrypsin inhibitor 

: Cleaved Antichymotrypsin A347R 

: Cleaved Antichymotrypsin A349R 

: Cleaved Antichymotrypsin T345R 


Subclass: 
Subclass: 
Subclass: 
Subclass: 
Subclass: 


| Alternate: 


| Subclass: 


Subclass 
Subclass 
Subclass 
Subclass: 
Subclass: 
Subclass: 
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crystallin, alpha B; crystallin, alpha-2; Rosenthal fiber component; heat-shock 20 kD like-protein 




integrin alpha 


VLA-3 alpha subunit j 


ntegrin alpha-3 (Galactoprotein B3) (GAPB3) (VLA-3 alpha chain) (CD49c) 1 


ntegrin alpha 3 isoform b 1 


ntegrin alpha 6 1 


ntegrin alpha-6 chain precursor, splice form A I 


ntegrin alpha-6 chain precursor, splice form B | 


ntegrin alpha 7 j 




transmembrane protein claudin 5; androgen withdrawal and apoptosis induced protein RVP1 (rat)-like; Claudin-5 (transmembrane 
protein deleted in velocardiofacial syndrome) 




DC-specific transmembrane protein 


chromosome 11 open reading frame 24 




inducible T-cell co-stimulator; activation-inducible lymphocyte immunomediatory molecule; inducible costimulator 
Alternate: Similar to inducible T-cell co-stimulator 


heat shock 70kDa protein 














Subclass: 


Subclass: 


Subclass 


Subclass 


Subclass 


Subclass 


Subclass 
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D)2.05 
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D) 2.04 




U:(HI- 
D)2.01 


U:(C-HI 
)+3.19 

U:(C-D) 
+2.42 




U:(C-HI 
)6.6 




U:(C-HI 
)3.58 
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NP 034094.1 
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NM_013805 
NP 038833.1 




AK014697 
BAB29508.1 


AK007868 
BAB25319.1 




NM_0 17480 
NP_059508.1 




M12571 
AAA57234.1 
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annexin VII isoform 1 ; annexin VII (synexin); synexin I 


annexin A1 3 isoform b I 


annexin A13; annexin XIII; annexin, intestine-specific I 


annexin 3 1 ; annexin XXXI I 


keratinocyte annexin-like protein I 


protein PP4-X I 


protein p68 (1 -673) I 




intestinal fatty acid binding protein 2; Fatty acid-binding protein, intestinal; l-FABP; fatty acid binding protein 2, intestinal 




cytochrome P450 


Subclass: cytochrome P450, subfamily XVII polypeptide; steroid 17-alpha-monooxygenase; steroid 17-alpha-hydroxylase/17,20 
lyase; cytochrome p450 XVI I A1 


Subclass: cytochrome P450, subfamily I (aromatic compound-inducible), polypeptide 1 ; flavoprotein-linked monooxygenase; 
cytochrome P1-450, dioxin-inducible; aryl hydrocarbon hydroxylase; P450 form 6; xenobiotic monooxygenase; microsomal 
monooxvqenase 


: cytochrome P450-1A2 I 


: cytochrome P450 4 I 


: Cytochrome P450 XXIB (Steroid 21 -hydroxylase) (P450-C21 B) j 


: cytochrome P450, subfamily XXIA polypeptide 2; steroid 21 -monooxygenase; steroid 21 -hydroxylase j 


: cytochrome P450CYP1B1 I 




Subclass: 


Subclass: 


Subclass: 


Subclass: 


Subclass: 


Alternate: 


Alternate: 


Subclass 


| Subclass 


I Subclass 


Subclass 


Subclass 
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chromosome 1 1 open reading frame 24 




Regulator of G-protein signaling 16 (RGS16) (Retinally abundant regulator of G-protem signaling) (RGS-R) (A28-RGS14P) 




aminolevulinate synthase 


aminolevulinate synthase 1 I 


5-aminolevulinic acid synthase I 


5-aminolevulinic acid synthase, erythroid-specific, mitochondrial precursor (Delta-aminolevulinate synthase) (Delta-ALA 
e) (ALAS-E) 


ISubclass: aminolevulinate, delta-, synthase 2; Aminolevulinate, delta-, synthase-2 I 


i Subclass: Similar to aminolevulinate, delta-, synthase 2 (sideroblastic/hypochromic anemia) I 




Aldehvde dehydrogenase 


1 Subclass: similar to fatty aldehyde dehydrogenase 1 


Subclass: aldehyde dehydrogenase 3A2; aldehyde dehydrogenase 10; aldehyde dehydrogenase 3 family, member A2; fatty 
aldehyde dehydroaenase 


Subclass: aldehyde dehydrogenase 3 family, member A1 ; aldehyde dehydrogenase, dimeric NADP-preferring; acetaldehyde 
dehydrogenase; ALDH, stomach type 


: aldehyde dehydrogenase 3B1; aldehyde dehydrogenase 7; aldehyde dehydrogenase 3 family, member B1 j 


: Similar to aldehyde dehydrogenase 3 family, member B1 

: aldehyde dehydrogenase 3B2; aldehyde dehydrogenase 8; aldehyde dehydrogenase 3 family, member B2 


: Similar to aldehyde dehydrogenase 3 family, member B2 I 




[Subclass: 


I Subclass: 


Subclass: 
synthetas 


ISubclass 


Subclass 
Subclass 


1 Subclass 
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carnitine acetyltransferase 


Carnitine O-acetyltransferase (Carnitine acetylase) (CAT) I 


carnitine acetyltransferase isoform 1 j 


carnitine acetyltransferase isoform 2 ] 


carnitine acetyltransferase isoform 3 I 




X-ray repair cross complementing protein 2; X-ray repair, complementing defective, repair in Chinese hamster; DNA repair protein 
XRCC2 


elongation of very long chain fatty acids (FEN1/Elo2, SUR4/Elo3, yeast)-like 2 


Alternate: elongation of very long chain fatty acids (FEN1/Elo2, SUR4/Elo3, yeast)-like 4; Stargardt disease 3 (autosomal 
dominant) 




CD3Z antigen, zeta polypeptide (TiT3 complex) 




fatty acid desaturase 


: fatty acid desaturase 2; delta-6 fatty acid desaturase; linoleoyl-CoA desaturase (delta-6-desaturase)-like 2 | 


: fatty acid desaturase 3; delta-6 fatty acid desaturase; linoleoyl-CoA desaturase. (delta-6-desaturase)-like 3 | 


: fatty acid desaturase 1; linoleoyl-CoA desaturase (delta-6-desaturase)-like 1; delta-5 desaturase; delta-5 fatty acid 
se 




Subclass: 


Subclass: 


Subclass: 


Subclass: 


Subclass 


j Subclass 


Subclass 
desatura; 
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thioredoxin interacting protein; upregulated by 1,25-dihydroxyvitamin D-3 




DnaJ (Hsp40) homolog, subfamily B, member 9; microvascular endothelial differentiation gene 1; DKFZP564F1862 protein; 
endoplasmic reticulum DnaJ homolog 4 


| Alternate: similar to putative microvascular endothelial differentiation gene 1 ; similar to X98993 (PID:g1 771 560) I 


Kruppel-like factor 15; KKLF protein; kidney-enriched Kruppel-like factor 
Zinc finger protein 


: zinc finger protein 93 homolog; zinc finger protein homologous to mouse Zfp93; zinc finger protein homologous to Zfp9C 
I zinc finger protein 93 homolog (mouse) 


: zinc finger protein 226; Kruppel-associated box protein | 


: Zinc finger protein ZNF45 j 


: similar to Zinc finger protein 229 1 


: zinc finger protein 224 | 


: zinc finger protein 228 | 


: similar to ZNF228 protein j 


: Zinc finger protein 234 (Zinc finger protein HZF4) j 


: similar to Zinc finger protein 234 (Zinc finger protein HZF4) | 


: zinc finger protein 225 | 




similar to RIKEN cDNA 1810054013 




Subclass: 
in mouse: 


| Subclass: 


| Subclass: 


(Subclass: 


| Subclass: 


jSubclass 


jSubclass: 


|Subclass: 


[Subclass: 


Subclass: 
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hydroxyacid oxidase 


; medium-chain 2-hydroxy acid oxidase HAOX3; (S)-2-hydroxy-acid oxidase; glycolate oxidase | 


; long-chain L-2-hydroxy acid oxidase; (S)-2-hydroxy-acid oxidase; glycolate oxidase | 


; (S)-2-hydroxy-acid oxidase; glycolate oxidase i 




platelet-derived growth factor receptor 


: platelet-derived growth factor receptor alpha polypeptide [ 


: platelet-derived growth factor receptor beta; beta platelet-derived growth factor receptor | 


vascular endothelial growth factor receptor | 


: Vascular endothelial growth factor receptor 3 (VEGFR-3) (Tyrosine-protein kinase receptor FLT4) j 


: vascular endothelial growth factor receptor 2 | 


KIT protein j 


colony stimulating factor 1 receptor, formerly McDonough feline sarcoma viral (v-fms) oncogene homolog | 


Macrophage colony stimulating factor I receptor (CSF-1-R) (Fms proto-oncogene) (c-fms) (CD1 15 antigen) j 


FLT3 receptor tyrosine kinase | 


fms-related tyrosine kinase 3 I 


fms-related tyrosine kinase 1 (vascular endothelial growth factor/vascular permeability factor receptor) j 


fms-related tyrosine kinase 4; fms-related tyrosine kinase-4 (vascular endothelial growth factor receptor 3) ! 




inhibin beta C chain preproprotein; activin beta-C chain 


[Alternate: activin beta E j 


ATP-binding cassette, sub-family D 


ATP-binding cassette, sub-family D, member 2; adrenoleukodystrophy-like 1 ; hALDR | 


ATP-binding cassette, sub-family D (ALD), member 1 ; adrenoleukodystrophy protein j 


ATP-binding cassette, sub-family D, member 3; Peroxisomal membrane protein-1 (70kD); peroxisomal membrane 
(70kD, Zellweger syndrome); peroxisomal membrane protein-1 


: hydroxyacid oxidase 3; 


: hydroxyacid oxidase 2; 


: hydroxyacid oxidase 1 ; 


Subclass: 


Subclass: 


Subclass: 


Subclass: 


jSubclass: 


| Alternate: 


jSubclass: 


Subclass: 


(Alternate: 


| Alternate: 


I Alternate: 


| Alternate: 


Alternate: 


Alternate: 


[Alternate: 


Subclass: 


Subclass: 


Subclass: 
protein 1 
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SWI/SNF-related matrix-associated actin-dependent regulator of chromatin a-like 1; HepA-related protein; SMARCA-like protein 1 




Similar to RIKEN cDNA 1700018018 gene 




dynein, cytoplasmic, light polypeptide; Dynein, cytoplasmic, light chain (protein inhibitor of neuronal NOS); protein inhibitor of 
neuronal nitric oxide synthase 


sema domain, seven thrombospondin repeats (type 1 and type 1-like), transmembrane domain (TM) and short cytoplasmic 
domain, (semaphorin) 5A; semaphorin F; sema domain, seven thrombospondin repeats (type 1 and type 1-like), transmembrane 
domain (TM) and short cytoplasmic domain, 5A 


| Alternate: KIAA1 445 protein J 


| Alternate: similar to KIAA1445 protein j 




hypothetical protein MGC2605 


|Alternate: similar to hydroxyacyl glutathione hydrolase 2 j 




TBP-associated factor 6 Subclass: 


Subclass: TBP-associated factor 6 isoform gamma; TAF6 RNA polymerase II, TATA box binding protein (TBP)-associated factor, 
80 kD; TATA box binding protein (TBP)-associated factor, RNA polymerase II, E, 70/85kD; transcription initiation factor TFIID 70 
kD subunit . 


Subclass: TBP-associated factor 6 isoform delta; TAF6 RNA polymerase II, TATA box binding protein (TBP)-associated factor, 80 
kD; TATA box binding protein (TBP)-associated factor, RNA polymerase II, E, 70/85kD; transcription initiation factor TFIID 70 kD 
subunit 
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serine/threonine protein kinase sgk (serum/glucocorticoid regulated kinase) 


Alternate: serum/glucocorticoid regulated kinase-like; cytokine-independent survival kinase ] 


Alternate: v-akt murine thymoma viral oncogene homolog 3 (protein kinase B, gamma); protein kinase B j 




monoglyceride lipase 




solute carrier family 1 (glutamate/neutral amino acid transporter) 


Subclass: solute carrier family 1 (glutamate/neutral amino acid transporter), member 4; Solute carrier family 1 (glutamate/neutral 
amino acid transporter), v 


Subclass: solute carrier family 1 (neutral amino acid transporter), member 5; baboon M7 virus receptor; RD114 virus receptor; 
neutral amino acid transporter B 


Alternate: sodium-dependent neutral amino acid transporter type 2 truncated isoform \ 


| Alternate: neutral amino acid transporter. B | 




membrane-associated guanylate kinase-related 3 


: similar to membrane-associated guanylate kinase MAGI3 j 


: MAGMA 


3 

CD 

X) 
O 

(5 
< 


:MAGI-1B alpha beta j 


: dJ730K3.2 (similar to BAH -associated protein) j 


atrophin-1 interacting protein 1 ; activin receptor interacting p; KIAA0705 gene product | 


BAH -associated protein 1; WW domain-containing protein 3 | 


brain-specific angiogenesis inhibitor-associated protein 1 


Alternate: 


| Alternate: 


(Alternate: 


Alternate 


[Alternate: 


| Alternate: 


| Alternate: 


| Alternate: 
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i splice variant 2 

n protein 1 isoform PCM1 

in protein 1 isoform 3 

nnni7.i23 


five-lipoxygenase activating protein (FLAP) 


Rruppel-like factor 1 3; transcription factor NSLP1 ; novel Sp1 like zinc finger transcription factor; RANTES factor of late activated T 
ivmnhnr.vtfis-1 : basic transcription element binding protein 3 _ 


Alternate- similar to Krueppel-like factor 13 (Transcription factor BTEB3) (Basic transcription element binding piulem J) 

(BTE-binding protein 3) (RANTES factor of late activated T lymphocytes-1) (RFLAT-1) (Transcription factor NSLP1) (Novel 

Rn1 -like zinc fi — — 


hypothetical protein MGC 1779 1 




methyl-CpG binding domai 
, methyl-CpG binding proteii 
; methyl-CpG binding domai 
: methyl-CpG binding domai 
: methyl-CpG binding doma 

cat protein MGC17791 

; similar to RIKEN cDNA26 


Subclass: 
Subclass: 
Subclass' 
Subclass 
Subclass 

hypothetic 
I Alternate 
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cyclin-dependent kinase 3 | 


Pcdk2/Cyclin A In Complex With Mgadp, Nitrate and Peptide Substrate | 


: Cyclin-Dependent Kinase-2; Chain: A, C; Synonym: Cdk2; Ec: 2.7. 1.- | 


: Cell Division Protein Kinase 2; Chain: A; Synonym: Cyclin Dependent Kinase 2; Ec: 2.7.1 .37 | 


: PCTAIRE protein kinase 2; serine/threonine-protein kinase PCTAIRE-2; protein kinase cdc2-related PCTAIRE-2 | 


: Cdk5-P25(Nck5A) Complex - Protein Kinase II Catalytic Subunit, Cdk5 - Cdk5 Activator 1 , Cyclin-Dependent Kinase 5 
y Subunit 1, Protein Kinase II 23 Kda Subunit, Tpkii Regulatory Subunit, P23, P25, P35 




cytochrome P450, 

< 


Subclass: cytochrome P450, subfamily IVA, polypeptide 1 1; fatty acid omega-hydroxylase; P450HL-omega; alkane-1 
monooxygenase; lauric acid omega-hydroxylase (CYP4A1 1) 


Subclass: cytochrome P450, subfamily IVB, polypeptide 1; cytochrome P450, subfamily IVB, member 1; microsomal 
monooxygenase 


|Subclass: cytochrome P450, subfamily IVF, polypeptide 2; leukotriene B4 omega-hydroxylase; leukotriene-B4 20-monooxyqenas4 


Subclass: cytochrome P450, subfamily IVF, polypeptide 3; leukotriene B4 omega hydroxylase; leukotriene-B4 
20-monooxygenase; cytochrome P450-LTB-omega 


: cytochrome P450, subfamily IVF, polypeptide 1 1 | 


: Cytochrome P450 4F1 2 (CYPIVF1 2) | 


: cytochrome P450, subfamily IVF, polypeptide.8; microsomal monooxygenase; flavoprotein-linked monooxygenase I 


: similar to CYTOCHROME P450 4F6 (CYPIVF6) ! 


cytochrome P-450LTBV j 




Subclass: 


Subclass: 


Subclass: 


(Subclass; 


|Subclass: 


Subclass: 
Regulator 


Subclass: 


|Subclass; 


(Subclass; 


(Subclass: 


iSubclass: 
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winged helix/forkhead transcription factor 


Alternate: HNF-3/forkhead-like protein 1 


cartilage associated protein 




cytochrome P450, subfamily IIC 


Subclass: cytochrome P450, subfamily IIC (mephenytoin 4-hydroxylase), polypeptide 18; cytochrome P450, subfamily IIC 
(mephenytoin 4-hydroxylase), polypeptide 17; microsomal monooxygenase; flavoprotein-linked monooxygenase 


Subclass: cytochrome P450, subfamily IIC (mephenytoin 4-hydroxylase), polypeptide 19; mephenytoin 4-hydroxylase; microsoma 
monooxygenase; xenobiotic monooxygenase; flavoprotein-linked monooxygenase 


| Subclass: Cytochrome P450 2C8 (CYPIIC8) (P450 form 1 ) (P450 MP-12/MP-20) (P450 IIC2) (S-meohenvtoin 4-hvdroxvlase) I 


Subclass: cytochrome P450, subfamily IIC, polypeptide 9; cytochrome P450, subfamily IIC (mephenytoin 4-hydroxylase), 
polypeptide 10; mephenytoin 4-hydroxylase; microsomal monooxygenase; xenobiotic monooxygenase; flavoprotein-linked 
monooxygenase 


ISubclass: Cytochrome P450 2C10 (CYPIIC10) (P450 MP-8) (S-mephenytoin 4-hydroxylase) (P-450MP) i 
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Alpha-Amylase 




amylase, aloha 2B; pancreatic; Amylase, pancreatic, alpha-2B . . J 


similar to Aloha-amvlase, salivary precursor (1 ,4-alpha-D-glucan glucanohydrolase) 


: amvlase. aloha 1A; salivary; Amylase, salivary, alpha- 1 A * _ 1 




CD36 antigen (collagen type I receptor, thrombospondin receptor); CD36 antigen (collagen type I) 




similar to uridine phosphorylase; similar to Q1 6831 (PID:g2494059) 


1 Alternate: Uridine phosphorvlase 




MHC class II histocompatibility antigen 


Isubclass: MHC class II histocompatibility antigen DQw1-beta chain precursor I 


ISubclass: MHC class II HLA-DQ-beta-1 I 


Subclass: 


Subclass: 


Subclass: 


Subclass 
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complement component 


; complement component C6 [ 


: similar to Complement component C6 precursor 1 


: complement C7 1 




deoxyribonuclease 1 


: deoxyribonuclease l-like 3 ] 


: DNase gamma j 


: deoxyribonuclease 1 j 


: deoxyribonuclease l-like 2 j 


: deoxyribonuclease l-like 1 j 


: DNL1L gene product j 




Low affinity immunoglobulin gamma FC region receptor ll-B precursor (FC-gamma Rll-B) (FCRII-B) (IGG FC receptor ll-B) 
(FC-gamma-RIIB) (CD32) (CDW32) 




aquaporin (water channel protein) 


Subclass: aquaporin 1 (channel-forming integral protein, 28kD) | 


|Subclass: 


Subclass; 


Subclass: 


| Subclass: 


Subclass: 


| Subclass: 


(Subclass: 


jSubclass: 


I Subclass: 
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egf-like module containing, mucin-like, hormone receptor-like sequence 2 isoform g j 


: egf-like module containing, mucin-like, hormone receptor-like sequence 2 isoform f I 


: egf-like module-containing mucin-like receptor 3 isoform a | 


: EGF-like module EMR2 [ 


lectomedin • ! 


: lectomedin-3 1 


; latrophilin 1; KIAA0.786 protein; lectomedin-1 ; latrophilin I 


: lectomedin-1 alpha I 


|vlectomedin-2 | 


: lectomedin-2; KIAA0821 protein j 


CD97 antigen, isoform 1 precursor; leukocyte antigen CD97; seven-span transmembrane protein | 


CD97 antigen, isoform 2 precursor; leukocyte antigen CD97; seven-span transmembrane protein j 




decay-acceleration factor 


Subclass: decay accelerating factor for complement (CD55, Cromer blood group system); Decay-accelerating factor of 
complement 


: decay-accelerating factor, splice form 1 I 


: decay-accelerating factor 1 ab | 


: decay-accelerating factor 4ab j 


: decay-accelerating factor 3 | 




Subclass: 


Subclass: 


|Subclass; 


Subclass; 


Alternate: 


| Subclass: 


| Subclass: 


jSubclass: 


| Subclass 


Alternate: 


Alternate: 


jSubclass 


|Subciass 


| Subclass 


jSubclass 
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CLAIMS 

1. A method of protecting a human subject from progression 
from a normoinsulinemic state to a hyperinsulinemic state, or 
5 from either to a type II diabetic state, which comprises 

administering to the subject a protective amount of at least one 
agent which is 

(1) a polypeptide which is substantially structurally identical 
10 or conservatively identical in sequence to a reference protein 

which is (a) selected from the group consisting of mouse and 
human proteins set forth in master table 1, subtables 1A and 1C, 
or (b) selected from the group consisting of human proteins 
within at least one of the human protein classes set forth in 
15 master table 2, subtables 2A and 2C, 

or 

(2) an expression vector encoding the polypeptide of (1) above 

2 0 and expressible in a human cell, under conditions conducive to 

expression of the polypeptide of (1) ; 

where said agent protects said subject from progression from a 
normoinsulinemic state to a hyperinsulinemic state, or from 
25 either to a type II diabetic state. 

2. A method of protecting a human subject from progression from 
a normoinsulinemic state to a hyperinsulinemic state, or from 
either to a type II diabetic state which comprises administering 

3 0 to the subject a protective amount of at least one agent which 

is 

(1) an antagonist of a polypeptide, occurring in said subject, 
which is substantially structurally identical or conservatively 
35 identical in sequence to a reference protein which is (a) 

selected from the group consisting of mouse and human proteins 
set forth in master table 1, subtable IB and 1C, or (b) selected 
from the group consisting of human proteins belonging to at 
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least one of the human protein classes set forth in master table 
2, subtables 2B and 2C, 

(2) an anti-sense vector which inhibits expression of said 
5 polypeptide in said subject, 

where said agent protects said subject from progression from a 
normoinsulinemic state to a hyper insulinemic state, or from 
either to a type II diabetic state. 

10 

3. A method of screening for human subjects who are prone 
to progression from a normoinsulinemic state to a 
hyperinsulinemic state, or from either to a type II diabetic 
state, which comprises assaying tissue or body fluid samples 

15 from said subjects to determine the level of expression of at 

least one "favorable" human marker gene, said human marker gene 
encoding a human protein which is substantially structurally 
identical or conservatively identical in sequence to a reference 
protein which is (a) selected from the group consisting of mouse 

2 0 and human proteins set forth in master table 1, subtables 1A and 

1C, or (b) selected from the group consisting of human proteins 
within at least one of the human protein classes set forth in 
master table 2, subtables 2A and 2C, 

25 and directly correlating the level of expression of said marker 
gene with the propensity to progression in said patient. 

4. A method of screening for human subjects who have a 
propensity for progression from a normoinsulinemic state to a 

30 hyperinsulinemic state, or from either to a type II diabetic 
state, which comprises assaying tissue or body fluid samples 
from said subjects to determine the level of expression of at 
least one "unfavorable" human marker gene, said human marker "' 
gene encoding a human protein which is substantially 

3 5 structurally identical or conservatively identical in sequence" 

to a reference protein which is (a) selected from the group 
consisting of mouse and human proteins set forth in master table 
1, subtable IB and 1C, or (b) selected from the group consisting 
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of human proteins belonging to at least one of the human protein 
classes set forth in master table 2, subtables 2B and 2C, 

and inversely correlating the level of expression of said marker 
5 gene with the propensity to progression in said patient. 

5. The method of claims 1 or 3 in which the reference protein is 
of subtable 1A or of a class set forth in subtable 2A. 

10 6. The method of claims 1 or 3 in which the reference protein is 
of subtable IB or of a class set forth in subtable 2B. 

7. The method of any one of claims 1-6 in which (a) applies. 

15 8. The method of any one of claims 1-7 in which the reference 
protein is a human protein. 

9. The method of any one of claims 1-7 in which the reference 
protein is a mouse protein. 

20 

10. The method of any one of claims 3 or 4 in which the level 
of expression of the marker protein is ascertained by measuring 
the level of the corresponding messenger RNA. 

11. The method of any one of claims 3 or 4in which the level of 
25 expression is ascertained by measuring the level of a protein 

encoded by said marker gene. 

12. The method of any one of claims 1-9 in which said 
polypeptide is at least 80% identical or at least highly 

3 0 conservatively identical to said reference protein. 

13. The method of any one of claims 1-10 in which said 
polypeptide is at least 90% identical to said reference protein. 

14. The method of any one of claims 1-11 in which said 
3 5 polypeptide is identical to said reference protein. 



15. The method of any one of claims 1-14 in which the E-value 
cited for the reference protein in Master Table 1 is not more 
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than e-6. 

16. The method of claim 15 in which the E-value cited for the 
reference protein in Master Table 1 is less than e-10. 

5 

17. The method of claim 17 in which the E value calculated by 
BLASTN or BLASTX would be less than e-15, more preferably less 
than e-20, still more preferably less than e-40, even more 
preferably less than e-60, considerably more preferably less 

10 than e-80, and most preferably less than e-100. 

18. The method of any of claims 2-17 in which the antagonist is 
an antibody, or an antigen-specific binding fragment of an 
antibody. 

15 

19. The method of any of claims 2-17 in which the antagonist is 
a peptide, peptoid, nucleic acid, or peptide nucleic acid 
oligomer. 

20 20. The method of any of claims 2-17 in which the antagonist is 
an organic molecule with a molecular weight of less than 500 
daltons. 

21. The method of claim 20 in which said organic molecule is 
25 identifiable as a molecule which binds said polypeptide by 
screening a combinatorial library. 
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